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(54) A method for increasing the concentration of a nucleic acid molecule 



(57) A method for increasing the concentration of a 
nucleic acid molecule, said method comprising: 

(a) forming aqueous microcapsules from a water- 
in-oil emulsion, wherein a plurality of the microcap- 
sules include a nucleic acid molecule and an aque- 
ous solution comprising components necessary for 
nucleic acid amplification; and 



(b) amplifying the nucleic acid molecule in the mi- 
crocapsules to form further amplified copies of said 
nucleic acid molecule. 
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Description 

[0001] The present invention relates to methods for use in in vitro evolution of molecular libraries. In particular the 
present invention relates to methods of. selecting nucleic acids encoding gene products in which the nucleic acid and 
the activity of the encoded gene product are linked by compartmentation. 

[0002] Evolution requires the generation of genetic diversity (diversity in nucleic acid) followed by the selection of 
those nucleic acids which result in beneficial characteristics. Because the nucleic acid and the activity of the encoded 
gene product of an organism are physically linked (the nucleic acids being confined within the cells which they encode) 
multiple rounds of mutation and selection can result in the progressive survival of organisms with increasing fitness. 
Systems for rapid evolution of nucleic acids or proteins in vitro must mimic this process at the molecular level in that 
the nucleic acid and the activity of the encoded gene product must be linked and the activity of the gene product must 
be selectable. 

[0003] Recent advances in molecular biology have allowed some molecules to be co-selected according to their 
properties along with the nucleic acids that encode them. The selected nucleic acids can subsequently be cloned for 
further analysis or use, or subjected to additional rounds of mutation and selection. 

[0004] Common to these methods is the establishment of large libraries of nucleic acids . Molecules having the desired 
characteristics (activity) can be isolated through seJection regimes that select for the desired activity of the encoded 
gene product, such as a desired biochemical or biological activity, for example binding activity. 
[0005] Phage display technology has been highly successful as providing a vehicle that allows for the selection of 
a displayed protein by providing the essential link between nucleic acid and the activity of the encoded gene product 
(Smith, 1985; Bass era/. , 1 990; McCafferty etal, 1990; for review see Clackson and Wells, 1994). Filamentous phage 
particles act as genetic display packages with proteins on the outside and the genetic elements which encode them 
on the inside. The tight linkage between nucleic acid and the activity of the encoded gene product is a result of the 
assembly of the phage within bacteria. As individual bacteria are rarely multiply infected, in most cases all the phage 
produced from an individual bacterium will carry the same genetic element and display the same protein. 
[0006] However, phage display relies upon the creation of nucleic acid libraries in vivo in bacteria. Thus, the practical 
limitation on library size allowed by phage display technology is of the order of 10 7 to 10 11 , even taking advantage of 
X phage vectors with excisable filamentous phage replicons. The technique has mainly been applied to selection of 
molecules with binding activity. A small number of proteins with catalytic activity have also been isolated using this 
technique, however, in no case was selection directly for the desired catalytic activity, but either for binding to a tran- 
sition-state analogue (Widersten and Mannervik, 1995) or reaction with a suicide inhibitor (Soumillion et a/., 1994; 
Janda etal., 1997). 

[0007] Specific peptide ligands have been selected for binding to receptors by affinity selection using large libraries 
of peptides linked to the C terminus of the lac repressor Lacl (Cull et at., 1 992). When expressed in E. colt the repressor 
protein physically links the ligand to the encoding plasmid by binding to a lac operator sequence on the plasmid. 
[0008] An entirely in vitro polysome display system has also been reported (Mattheakis et at., 1 994) in which nascent 
peptides are physically attached via the ribosome to the RNA which encodes them. 

[0009] However, the scope of the above systems is limited to the selection of proteins and furthermore does not 
allow direct selection for activities other than binding, for example catalytic or regulatory activity. 
[0010] In vitro RNA selection and evolution (Ellington and Szostak, 1990), sometimes referred to as SELEX (sys- 
tematic evolution of ligands by exponential enrichment) (Tuerk and Gold, 1990) allows for selection for both binding 
and chemical activity, but only for nucleic acids. When selection is for binding, a pool of nucleic acids is incubated with 
immobilised substrate. Non-binders are washed away, then the binders are released, amplified and the whole process 
is repeated in iterative steps to enrich for better binding sequences. This method can also be adapted to allow isolation 
of catalytic RNA and DNA (Green and Szostak, 1992; for reviews see Chapman and Szostak, 1 994; Joyce, 1 994; Gold 
etal, 1995; Moore, 1995). 

[001 1 ] However, selection for "catalytic" or binding activity using SELEX is only possible because the same molecule 
performs the dual role of carrying the genetic information and being the catalyst or binding molecule (aptamer). When 
selection is for -auto-catalysis" the same molecule must also perform the third role of being a substrate. Since the 
genetic element must play the role of both the substrate and the catalyst, selection is only possible for single turnover 
events. Because the "catalyst" is in this process itself modified, it is by definition not a true catalyst. Additionally, proteins 
may not be selected using the SELEX procedure. The range of catalysts, substrates and reactions which can be 
selected is therefore severely limited. 

[0012] Those of the above methods that allow for iterative rounds of mutation and selection are mimicking in vitro 
mechanisms usually ascribed to the process of evolution: iterative variation, progressive selection for a desired the 
activity and replication. However, none of the methods so far developed have provided molecules of comparable di- 
versity and functional efficacy to those that are found naturally. Additionally, there are no man-made "evolution" systems 
which can evolve both nucleic acids and proteins to effect the full range of biochemical and biological activities (for 
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example, binding, catalytic and regulatory activities) and that can combine several processes leading to a desired 
product or activity. ■ 

[001 3J There is thus a great need for an in vitro system that overcomes the limitations discussed above. 
5 BRIEF DESCRIPTION OF THE INVENTION 

[001 4J According to a first aspect of the present invention, there is provided a method for isolating one or more 
genetic elements encoding a gene product having a desired activity, comprising the steps of: 

10 (a) compartmentalising genetic elements into microcapsules; 

(b) expressing the genetic elements to produce their respective gene products within the microcapsules; 

(c) sorting the genetic elements which produce the gene product(s) having the desired activity. 

[001 5] The microcapsules according to the present invention compartmentalise genetic elements and gene products 
15 such that they remain physically linked together. Surprisingly, nucleic acid expression remains possible within the 
artificial microcapsules allowing for isolation of nucleic acid on the basis if the activity of the gene product which it 
. encodes. 

[001 6} As used herein, a genetic element is a molecule or molecular construct comprising a nucleic acid. The genetic 
elements of the present invention may comprise any nucleic acid (for example, DNA, RNA or any analogue, natural 
20 or artificial, thereof). The nucleic acid component of the genetic element may moreover be linked, covalently or non- 
-covalently, to one or more molecules or structures, including proteins, chemical entities and groups, solid-phase sup- 
ports such as magnetic beads, and the like. In the method of the invention, these structures or molecules can be 
designed to assist in the sorting and/or isolation of the genetic element encoding a gene product with the desired activity. 
[0017] Expression, as used herein, is used in its broadest meaning, to signify that a nucleic acid contained in the 
genetic element is converted into its gene product. Thus, where the nucleic acid is DNA, expression refers to the 
transcription of the DNA into RNA; where this RNA codes for protein, expression may also refer to the translation of 
the RNA into protein. Where the nucleic acid is RNA, expression may refer to the replication of this RNA into further 
RNA copies, the reverse transcription of the RNA into DNA and optionally the transcription of this DNA into further 
RNA molecule(s), as well as optionally the translation of any of the RNA species produced into protein. Preferably, 
therefore, expression is performed by one or more processes selected from the group consisting of transcription, re- 
verse transcription, replication and translation. 

[0018] Expression of the genetic element may thus be directed into either DNA, RNA or protein, or a nucleic acid or 
protein containing unnatural bases or amino acids (the gene product) within the microcapsule of the invention, so that 
the gene product is confined within the same microcapsule as the genetic element. 

[0019] The genetic element and the gene product thereby encoded are linked by confining each genetic element 
and the respective gene product encoded by the genetic element within the same microcapsule. In this way the gene 
product in one microcapsule cannot cause a change in any other microcapsules. 

[0020] The term "microcapsule" is used herein in accordance with the meaning normally assigned thereto in the art 
and further described hereinbelow. In essence, however, a microcapsule is an artificial compartment whose delimiting 
borders restrict the exchange of the components of the molecular mechanisms described herein which allow the sorting 
of genetic elements according to the function of the gene products which they encode. 

[0021 ] Preferably, the microcapsules used in the method of the present invention will be capable of being produced 
in very large numbers, and thereby to compartmentalise a library of genetic elements which encodes a repertoire of 
gene products. 

[0022] According to a preferred embodiment of the first aspect of the present invention, the sorting of genetic elements 
may be performed in one of essentially four techniques. 

(I) in a first embodiment, the microcapsules are sorted according to an activity of the gene product or derivative 
thereof which makes the microcapsule detectable as a whole. Accordingly, the invention provides a method ac- 
cording to the first aspect of the invention wherein a gene product with the desired activity induces a change in 
the microcapsule, or a modification of one or more molecules within the microcapsule, which enables the micro- 
capsule containing the gene product and the genetic element encoding it to be sorted. In this embodiment, there- 
fore, the microcapsules are physically sorted from each other according to the activity of the gene product(s) 
expressed from the genetic element(s) contained therein, which makes it possible selectively to enrich for micro- 
capsules containing gene products of the desired activity. 

(II) In a second embodiment, the genetic elements are sorted following pooling of the microcapsules into one or 
more common compartments. In this embodiment, a gene product having the desired activity modifies the genetic 
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element which encoded it (and which resides in the same microcapsule) in such a way as to make it selectable in 
a subsequent step. The reactions are stopped and the microcapsules are then broken so that all the contents of 
the individual microcapsules are pooled. Selection for the modified genetic elements enables enrichment of the 
genetic elements encoding the gene product(s) having the desired activity. Accordingly, the invention provides a 
method according to the first aspect of the invention, wherein in step (b) the gene product having the desired 
activity modifies the genetic element encoding it to enable the isolation of the genetic element. It is to be understood, 
of course, that modification may be direct, in that it is caused by the direct action of the gene product on the genetic 
element, or indirect, in which a series of reactions, one or more of which involve the gene product having the 
desired activity, leads to modification of the genetic element. 



(HI) in a third embodiment, the genetic elements are sorted following pooling of the microcapsules into one or more 
common compartments. In this embodiment, a gene with a desired activity induces a change in the microcapsule 
containing the gene product and the genetic element encoding it. This change, when detected, triggers the mod- 
ification of the gene within the compartment. The reactions are stopped and the microcapsules are then broken 

15 so that all the contents of the individual microcapsules are pooled. Selection for the modified genetic elements 

enables enrichment of the genetic elements encoding the gene product(s) having the desired activity. Accordingly 
the invention provides a method according to the first aspect of the invention, where in step (b) the gene product 
having the desired activity induces a change in the compartment which is detected and triggers the modification 
of the genetic element within the compartment so as to allow its isolation. It is to be understood that the detected 

20 change in the compartment may be caused by the direct action of the gene product, or indirect action, in which a 

series of reactions, one or more of which involve the gene product having the desired activity leads to the detected 
change. 

(IV) In a fourth embodiment, the genetic elements may be sorted by a multi-step procedure, which involves at least 

25 two steps, for example, in order to allow the exposure of the genetic elements to conditions which permit at least 

two separate reactions to occur. As will be apparent to a persons skilled in the art, the first microencapsulation 
step of the invention must result in conditions which permit the expression of the genetic elements - be it transcrip- 
tion, transcription and/or translation, replication or the like. Under these conditions, it may not be possible to select 
for a particular gene product activity, for example because the gene product may not be active under these con- 

30 ditions, or because the expression system contains an interfering activity. The invention therefore provides a meth- 

od according to the first aspect of the present invention, wherein step (b) comprises expressing the genetic ele- 
ments to produce their respective gene products within the microcapsules, linking the gene products to the genetic 
elements encoding them and isolating the complexes thereby formed. This allows for the genetic elements and 
their associated gene products to be isolated from the capsules before sorting according to gene product activity 

35 takes place. In a preferred embodiment, the complexes are subjected to a further compartmentalisation step prior 

to isolating the genetic elements encoding a gene product having the desired activity. This further compartmen- 
talisation step, which advantageously takes place in microcapsules, permits the performance of further reactions, 
under different conditions, in an environment where the genetic elements and their respective gene products are 
physically linked. Eventual sorting of genetic elements may be performed according to embodiment (I), (II) or (III) 

40 above. 

[0023] The "secondary encapsulation" may also be performed with genetic elements linked to gene products by other 
means, such as by phage display, polysome display, RNA-peptide fusion or lac repressor peptide fusion. 
[0024] The selected genetic element(s) may also be subjected to subsequent, possibly more stringent rounds of 
45 sorting in iteratively repeated steps, reapplying the method of the invention either in its entirety or in selected steps 
only. By tailoring the conditions appropriately, genetic elements encoding gene products having a better optimised 
activity may be isolated alter each round of selection. 

[0025] Additionally, the genetic elements isolated after a first round of sorting may be subjected to mutagenesis 
before repeating the sorting by iterative repetition of the steps of the method of the invention as set out above. After 
50 each round of mutagenesis, some genetic elements will have been modified in such a way that the activity of the gene 
products is enhanced. 

[0026] Moreover, the selected genetic elements can be cloned into an expression vector to allow further character- 
isation of the genetic elements and their products. 

[0027] In a second aspect, the invention provides a product when selected according to the first aspect of the inven- 
55 tion. As used in this context, a "product" may refer to a gene product, selectable according to the invention, or the 
genetic element (or genetic information comprised therein). 

[0028] In a third aspect, the invention provides a method for preparing a gene product, comprising the steps of: 
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(a) preparing a genetic element encoding the gene product; 

(b) compartmentalising genetic elements into microcapsules; 

(c) expressing the genetic elements to produce their respective gene products within the microcapsules; 

(d) sorting the genetic elements which produce the gene produces) having the desired activity; and " 

(e) expressing the gene product having the desired activity. 

[0029] In accordance with the third aspect, step (a) preferably comprises preparing a repertoire of genetic elements, 
wherein each genetic element encodes a potentially differing gene product. Repertoires may be generated by conven- 
tional techniques, such as those employed for the generation of libraries intended for selection by methods such as 
phage display. Gene products having the desired activity may be selected from the repertoire, according to the present 
invention. 

[0030] In a fourth aspect, the invention provides a method for screening a compound or compounds capable of 
modulation the activity of a gene product, comprising the steps of: 

15 (a) preparing a repertoire of genetic element encoding gene product; 

(b) compartmentalising genetic elements into microcapsules; 

(c) expressing the genetic elements to produce their respective gene products within the microcapsules; 

(d) sorting the genetic elements which produce the gene product(s) having the desired activity; and 

(e) contacting a gene product having the desired activity with the compound or compounds and monitoring the 
20 modulation of an activity of the gene product by the compound or compounds. 

Advantageously, the method further comprises the step of: 

(f) identifying the compound or compounds capable of modulating the activity of the gene product and synthesising 
said compound or compounds. 



25 



[0031 ] This selection system can be configured to select for RNA, DNA or protein molecules with catalytic, regulatory 
or binding activity. 



BRIEF DESCRIPTION OF THE FIGURES 
30 Figure 1 

Gene selection by compartmentalisation. 
[0032] 

35 

a Schematic representation of the selection procedure. In Step 1, an in vitro transcription/translation reaction 
mixture containing a library of genetic elements linked to a substrate for the reaction being selected is dispersed 
to form a water-in-oil emulsion with typically one genetic element per aqueous compartment. The genetic elements 
are transcribed and translated within their compartments (Step 2). Subsequently (Step 3), proteins (or RNAs) with 

to enzymatic activities convert the substrate into a product that remains linked to the genetic element. Compartmen- 

talisation prevents the modification of genetic elements in other compartments. Next (Step 4), the emulsion is 
broken, all reactions are stopped and the aqueous compartments combined. Genetic elements which are linked 
to the product are selectively enriched, then amplified, and either characterised (Step 5), or linked to the substrate 
and compartmentalised for further rounds ol selection (Step 6). 

45 b Selection for target-specific DNA methytation by Haetil methytase. The substrate is a segment of DNA 

containing Haeill restriction/modification (R/M) sites. Genetic elements are isolated by binding to streptavidin- 
coated magnetic beads and treated with the cognate restriction enzyme Haelll. Only nucleic acids with methylated 
R/M sites are resistant to cleavage and subsequently amplified by PCR. 

50 Figure 2a 

[0033] Droplet size distribution and activities of DHFR and Haelll methyfase in emulsions: size distribution of the 
aqueous compartments in an emulsion determined by laser diffraction. In vitro transcription/translation reaction mix- 
tures containing DNA and sodium deoxycholate are emulsified by stirring, or by stirring followed by homogenisation 
55 at 8k, 9.5k or 1 3.5k rpm. The size distribution of the aqueous particles is shown by percentage of the total aqueous 
volume. 
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Figure 2b 

[0034] The activity of DHFR formed in situ by transcription and translation of its gene (Fig, 1b) in aqueous compart- 
ments of an emulsion. The concentration of the to/A gene used (2.5 nM) gives an average of one gene pefdroplet in 
the finest emulsions (homogenised at 13.5 k rpm). The mean diameter calculated from the size distribution data (in 
Figure 2a) is presented as a function of the speed of homogenisation (Ok rpm refers to the emulsion prepared by stirring 
with no further homogenisation). Activity is presented as percentage of the activity observed in the non-emulsified in 
vitro reaction mixture under the same conditions. 

[0035] The activity of Haelll methytase formed in situ by transcription and translation of its gene (Fig. 1b) in aqueous 
compartments of an emulsion. The concentration of the M.HaelU gene used (2.5 nM) gives an average of one gene 
per droplet in the finest emulsions (homogen ised at 1 3.5 k rpm). The mean diameter calculated from the size distribution 
data (in Figure 2a) is presented as a function of the speed of homogenisation; (0k rpm refers to the emulsion prepared 
by stirring with no further homogenisation). Activity is presented as percentage of the activity observed in the non- 
emulsified in vitro reaction mixture under the same conditions. 

Figure 3 

Selections for Haelll DNA methytase. 
[0036] 

a Selecting MHaellt genes from a 1000-fold excess of folA genes. Reactions were set up with 0.2 nM of DIG-fo/A- 
3s-Biotin DNA (corresponding to an average of one gene per compartment), spiked with 0.2 pM of DlG-M.Haelll- 
3s-Biotin. Reaction mixtures were either emulsified by stirring or left in solution. The DNA from these reactions 
was captured, digested with Haelll (or with Hha\) and amplified by PCR. This DNA was further amplified by nested 
PCR with primers LMB2-Nest and LMB3-Nest and five microiitres of each nested PCR was electrophoresed on a 
1 .5% agarose gel containing ethidium bromide. Markers, <J>X1 74-Haelll digest; minus T7, no T7 RNA polymerase; 
minus NadCh, no sodium deoxycholate. 

30 b Two-round selections. Reactions containing a 1:1 0 4 to 1:10 7 molar ratio of DIG-M.Haelll-3s-Biotin : DIGfolA-3s- 

Biotin (at 500 pM) are emulsified by stirring. The DNA from these reactions is digested with Haelll and amplified 
by PCR with primers LMB2-Biotin (SEQ. ID. No. 9) and LMB3-DIG (SEQ. ID. NO. 10). The amplified DNA from 
the first round selection of 1 :1 0 4 and 1:10 s ratios (at 20 pM) and the 1 : 1 0 6 and 1:10 7 ratios (at 500 pM) is put into 
a second round of selection. This DNA was further amplified by nested PCR with primers LMB2-Nest and 

35 LMB3-Nest and five microiitres of nested PCR from each round of selection are analysed by gel electrophoresis 

as above (upper panel). The same DNA was translated in vitro and the resulting methylase activity was measured. 
Results are presented as the percentage of substrate DNA methylated (lower panel). 
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DETAILED DESCRIPTION OF THE INVENTION 

(A) GENERAL DESCRIPTION 

[0037] The microcapsules of the present invention require appropriate physical properties to allow the working of 
the invention. 

[0038] First, to ensure that the genetic elements and gene products may not diffuse between microcapsules, the 
contents of each microcapsule must be isolated from the contents of the surrounding microcapsules, so that there is 
no or little exchange of the genetic elements and gene products between the microcapsules over the timescale of the 
experiment. 

[0039] Second, the method of the present invention requires that there are only a limited number of genetic elements 
per microcapsule. This ensures that the gene product of an individual genetic element will be isolated from other genetic 
elements. Thus, coupling between genetic element and gene product will be highly specific. The enrichment factor is 
greatest with on average one or fewer genetic elements per microcapsule, the linkage between nucleic acid and the 
activity of the encoded gene product being as tight as is possible, since the gene product of an individual genetic 
element will be isolated from the products of all other genetic elements. However, even if the theoretically optimal 
situation of, on average, a single genetic element or less per microcapsule is not used, a ratio of 5, 1 0, 50, 1 00 or 1 000 
or more genetic elements per microcapsule may prove beneficial in sorting a large library. Subsequent rounds of sorting, 
including renewed encapsulation with differing genetic element distribution, will permit more stringent sorting of the 
genetic elements. Preferably, there is a single genetic element, or fewer, per microcapsule. 
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[0040] Third, the formation and the composition of the microcapsules must not abolish the function of the machinery 
the expression of the genetic elements and the activity of the gene products. 

[0041] Consequently any microencapsulation system used must fulfil these three requirements. The appropriate 
system(s) may vary depending on the precise nature of the requirements in each application of the invention, as will 
be apparent to the skilled person. ' . 

[0042] A wide variety of microencapsulation procedures are available (see Benita, 1 996) and may be used to create 
the microcapsules used in accordance with the present invention. Indeed, more than 200 microencapsulation methods 
have been identified in the literature (Finch, 1993). 

[0043] These include membrane enveloped aqueous vesicles such as lipid vesicles (liposomes) (New, 1990) and 
non-ionic surfactant vesicles (van Hal et aL, 1996). These are closed-membranous capsules of single or multiple bi- 
layers of non-covalently assembled molecules, with each bilayer separated from its neighbour by an aqueous com- 
partment. In the case of liposomes the membrane is composed of lipid molecules; these are usually phospholipids but 
sterols such as cholesterol may also be incorporated into the membranes (New, 1 990). A variety of enzyme-catalysed 
biochemical reactions, including RNA and DNA polymerisation, can be performed within liposomes (Chakrabarti et aL, 
1994; Oberholzer era/., 1995a; Oberholzer et aL, 1995b; Walde era/., 1994; Wick & Luisi, 1996). 
[0044] With a membrane-enveloped vesicle system much of the aqueous phase is outside the vesicles and is there- 
fore non-compartmentalised. This continuous, aqueous phase should be removed or the biological systems in it inhib- 
ited or destroyed (for example, by digestion of nucleic acids with DNase or RNase) in order that the reactions are 
limited to the microcapsules (Luisi et aL, 1987). 

[0045] Enzyme-catalysed biochemical reactions have also been demonstrated in microcapsules generated by a 
variety of other methods. Many enzymes are active in reverse micellar solutions (Bru & Walde, 1991; Bru & Walde, 
1993; Creagh et at., 1993, Haber et aL, 1993; Kumar et aL, 1989; Luisi & B., 1987; Mao & Walde, 1991; Mao et at ' 
1992; Perez et a/.,-1992; Walde et al., 1994; Walde etaL, 1993; Walde et aL, 1988) such as the AOT-isooctane-water 
system (Menger & Yamada, 1979). 

[0046] Microcapsules can also be generated by interracial polymerisation and interfacial complexation (Whateley, 
1 996). Microcapsules of this sort can have rigid, nonpermeable membranes, or semipermeable membranes. Semiper- 
meable microcapsules bordered by cellulose nitrate membranes, polyamide membranes and lipid-polyamide mem- 
branes can all support biochemical reactions, including multienzyme systems (Chang, 1 987: Chang, 1 992; Lim, 1 984). 
Alginate/polylysine microcapsules (Lim & Sun, 1980), which can be formed under very mild conditions, have also 
proven to be very biocompatible, providing, for example, an effective method of encapsulating living cells and tissues 
(Chang, 1992; Sun et aL, 1992). 

[0047] Non-membranous microencapsulation systems based on phase partitioning of an aqueous environment in a 
colloidal system, such as an emulsion, may also be used. 

[0048] Preferably, the microcapsules of the present invention are formed from emulsions; heterogeneous systems 
of two immiscible liquid phases with one of the phases dispersed in the other as droplets of microscopic or colloidal 
size (Becher, 1957; Sherman : 1968; Lissant, 1974; Lissant, 1984). 

[0049] Emulsions may be produced from any suitable combination of immiscible liquids. Preferably the emulsion of 
the present invention has water (containing the biochemical components) as the phase present in the form of finery 
divided droplets (the disperse, internal or discontinuous phase) and a hydrophobic, immiscible liquid (an 'oil 1 ) as the 
matrix in which these droplets are suspended (the nondisperse, continuous or external phase). Such emulsions are 
termed 'water-in-oil' (W/O). This has the advantage that the entire aqueous phase containing the biochemical compo- 
nents is compartmentalised in discreet droplets (the internal phase). The external phase, being a hydrophobic oil, 
generally contains none of the biochemical components and hence is inert. 

[0050] The emulsion may be stabilised by addition of one or more surface-active agents (surfactants). These sur- 
factants are termed emulsifying agents and act at the water/oil interface to prevent (or at least delay) separation of the 
phases. Many oils and many emulsifiers can be used for the generation of water-in-oil emulsions; a recent compilation 
listed over 1 6,000 surfactants, many of which are used as emulsifying agents (Ash and Ash, 1 993). Suitable oils include 
tight white mineral oil and non-ionic surfactants (Schick, 1966) such as sorbitan monooleate (Span™80; ICI) and poly- 
oxyethylenesorbitan monooleate (Tween™80; ICI). 

[0051 ] The use of anionic surfactants may also be beneficial. Suitable surfactants include sodium cholate and sodium 
taurocholate. Particularly preferred is sodium deoxycholate, preferably at a concentration of 0.5% w/v, or below. Inclu- 
sion of such surfactants can in some cases increase the expression of the genetic elements and/or the activity of the 
gene products. Addition of some anionic surfactants to a non-emulsified reaction mixture completely abolishes trans- 
lation. During emulsification, however, the surfactant is transferred from the aqueous phase into the interface and 
activity is restored. Addition of an anionic surfactant to the mixtures to be emulsified ensures that reactions proceed 
only after compartmentalisation. 

[0052] Creation of an emulsion generally requires the application of mechanical energy to force the phases together. 
There are a variety of ways of doing this which utilise a variety of mechanical devices, including stirrers (such as 
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magnetic stir-bars, propeller and turbine stirrers, paddle devices and whisks), homogenisers (including rotor-stator 
homogenisers, high-pressure valve homogenisers and jet homogenisers), colloid mills, ultrasound and 'membrane 
emutsification* devices (Becher, 1957; Dickinson, 1994). 

[0053] Aqueous microcapsules formed in water-in-oil emulsions are generally stable with little if any exchange of 
5 genetic elements or gene products between microcapsules. Additionally, we have demonstrated that several biochem- 
ical reactions proceed in emulsion microcapsules. Moreover, complicated biochemical processes, notably gene tran- 
scription and translation are also active in emulsion microcapsules. The technology exists to create emulsions with 
volumes all the way up to industrial scales of thousands of litres (Becher, 1 957; Sherman, 1 968; Lissant, 1 974; Lissant, 
1984). 

10 [0054] The preferred microcapsule size will vary depending upon the precise requirements of any individual selection 
process that is to be performed according to the present invention. In all cases, there will be an optimal balance between 
gene library size, the required enrichment and the required concentration of components in the individual microcapsules 
to achieve efficient expression and reactivity of the gene products. 

[0055] The processes of expression must occur within each individual microcapsule provided by the present inven- 
ts tion. Both in vitro transcription and coupled transcription-translation become less efficient at sub-nanomolar DNA con- 
centrations. Because of the requirement for only a limited number of DNA molecules to be present in each microcapsule, 
this therefore sets a practical upper limit on the possible microcapsule size. Preferably, the mean volume of the micro- 
capsules is less that 5.2 x 10' 16 m 3 , (corresponding to a spherical microcapsule of diameter less than 10pm, more 
preferably less than 6.5 x 10' 17 rn 3 (5prn), more preferably about 4.2 x 10 18 m 3 (2pm) and ideally about 9 x 10* 18 m 3 
20 (2.6um). 

[0056] The effective DNA or RN A concentration in the microcapsules may be artificially increased by various methods 
that will be well-known to those versed in the art. These include, for example, the addition of volume excluding chemicals 
such as polyethylene glycols (PEG) and a variety of gene amplification techniques, including transcription using RNA 
polymerases including those from bacteria such as E. coli (Roberts, 1 969; Blattner and Dahlberg, 1972; Roberts eta/., 

25 1975; Rosenberg etai , 1975) , eukaryotes e. g. (Weil era/. , 1979; Manley etai, 1983) and bacteriophage such as 
T7, T3 and SP6 (Melton et al., 1 984); the polymerase chain reaction (PCR) (Saiki era/., 1 988);Q|3 replicase amplification 
(Miele et ai, 1983; Cahill et al., 1991; Chetverin and Spirin, 1995; Katanaev era/., 1995); the ligase chain reaction 
(LCR) (Landegren etai., 1988; Barany, 1991); and self-sustained sequence replication system (Fahy etai, 1991) and 
strand displacement amplification (Walker etai, 1992). Even gene amplification techniques requiring thermal cycling 

30 such as PCR and LCR could be used if the emulsions and the in vitro transcription or coupled transcription-translation 
systems are thermostable (for example, the coupled transcription-translation systems could be made from a thermosta- 
ble organism such as Thermus aquaticus). 

[0057] Increasing the effective local nucleic acid concentration enables larger microcapsules to be used effectively. 
This allows a preferred practical upper limit to the microcapsule volume of about 5.2 x tO' 16 m 3 (corresponding to a 

35 sphere of diameter 10pm). 

[0058] The microcapsule size must be sufficiently large to accommodate all of the required components of the bio- 
chemical reactions that are needed to occur within the microcapsule. For example, in vitro, both transcription reactions 
and coupled transcription-translation reactions require a total nucleoside triphosphate concentration of about 2rnM. 
[0059] For example, in order to transcribe a gene to a single short RNA molecule of 500 bases in length, this would 

40 require a minimum of 500 molecules of nucleoside triphosphate per microcapsule (8.33 x 10 22 motes). In order to 
constitute a 2mM solution, this number of molecules must be contained within a microcapsule of volume 4.17 x 10 -19 
litres (4.1 7 x 10* 22 m 3 which if spherical would have a diameter of 93nm. 

[0060] Furthermore, particularly in the case of reactions involving translation, it is to be noted that the ribosomes 
necessary for the translation to occur are themselves approximately 20nm in diameter. Hence, the preferred lower limit 

45 for microcapsules is a diameter of approximately 0.1pm (100nm). 

[0061] Therefore, the microcapsule volume is preferably of the order of between 5.2 x 10 22 m 3 and 5.2 x 10' 16 m 3 
corresponding to a sphere of diameter between 0.1pm and 10p.m. more preferably of between about 5.2 x 10" 19 m 3 
and 6.5 x 10' 17 m 3 (1pm and 5pm). Sphere diameters of about 2.6pm are most advantageous. 
[0062] It is no coincidence that the preferred dimensions of the compartments (droplets of 2.6pm mean diameter) 

50 closely resemble those of bacteria, for example, Escherichia are 1 .1 -1 .5 x 2.0-6.0 pm rods and Azotobacter are 1 .5-2.0 
pm diameter ovoid cells. In its simplest form, Darwinian evolution is based on a 'one genotype one phenotype' mech- 
anism. The concentration of a single compartmentalised gene, or genome, drops from 0.4 nM in a compartment of 2 
pm diameter, to 25 pM in a compartment of 5 pm diameter. The prokaryotic transcriptionAranslation machinery has 
evolved to operate in compartments of — 1-2 pm diameter, where single genes are at approximately nanomolar con- 

55 centrations. A single gene, in a compartment of 2.6 pm diameter is at a concentration of 0.2 nM. This gene concentration 
is high enough for efficient translation. Compartmentalisation in such a volume also ensures that even if only a single 
molecule of the gene product is formed it is present at about 0.2 nM, which is important if the gene product is to have 
a modifying activity of the genetic element itself. The volume of the microcapsule should thus be selected bearing in 
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mind not only the requirements for transcription and translation of the genetic element, but also the modifying activity 
required of the gene product in the method of the invention. 

[0063J The size of emulsion microcapsules may be varied simply by tailoring the emulsion conditions used to form 
the emulsion according to requirements of the selection system. The. larger the microcapsule size, the larger is the 

5 volume that will be required to encapsulate a given genetic element library, since the ultimately limiting factor will be 
the size of the microcapsule and thus the number of microcapsules possible per unit volume. 
[0064J The size of the microcapsules is selected not only having regard to the requirements of the transcription/ 
translation system, but also those of the selection system employed for the genetic element. Thus, the components of 
the selection system, such as a chemical modification system, may require reaction volumes and/or reagent concen- 

10 trations which are not optimal for transcription/translation. As set forth herein, such requirements may be accommo- 
dated by a secondary re-encapsulation step; moreover, they may be accommodated by selecting the microcapsule 
size in order to maximise transcription/translation and selection as a whole. Empirical determination of optimal micro- 
capsule volume and reagent concentration, for example as set forth herein, is preferred. 

[0065] A "genetic element" in accordance with the present invention is as described above. Preferably, a genetic 
element is a molecule or construct selected from the group consisting of a DNA molecule, an RNA molecule, a partially 
or wholly artificial nucleic acid molecule consisting of exclusively synthetic or a mixture of naturally-occurring and 
synthetic bases, any one of the foregoing linked to a polypeptide, and any one of the foregoing linked to any other 
molecular group or construct. Advantageously, the other molecular group or construct may be selected from the group 
consisting of nucleic acids, polymeric substances, particularly beads, for example polystyrene beads, magnetic sub- 
20 stances such as magnetic beads, labels, such as fluorophores or isotopic labels, chemical reagents, binding agents 
such as macrocycles and the like. 

[0066] The nucleic acid portion of the genetic element may comprise suitable regulatory sequences, such as those 
required for efficient expression of the gene product, for example promoters, enhancers, translational initiation se- 
quences, polyadenylation sequences, splice sites and the like. 

25 [0067] As will be apparent from the following, in many cases the polypeptide or other molecular group or construct 
is a ligand or a substrate which directly or indirectly binds to or reacts with the gene product in order to tag the genetic 
element This allows the sorting of the genetic element on the basis of the activity of the gene product. 
[0068] The ligand or substrate can be connected to the nucleic acid by a variety of means that will be apparent to 
those skilled in the art (see, for example, Hermanson, 1 996). Any tag will suffice that allows for the subsequent selection 

30 of the genetic element. Sorting can be by any method which allows the preferential separation, amplification or survival 
of the tagged genetic element. Examples include selection by binding (including techniques based on magnetic sep- 
aration, for example using Dynabeads™), and by resistance to degradation (for example by nucleases, including re- 
striction endonucleases). 

[0069] One way in which the nucleic acid molecule may be linked to a ligand or substrate is through biotinylation. 
35 This can be done by PCR amplification with a 5'-biotinylation primer such that the biotin and nucleic acid are covaiently 
linked. 

[0070] The ligand or substrate to be selected can be attached to the modified nucleic acid by a variety of means that 
will be apparent to those of skill in the art. A biotinylated nucleic acid may be coupled to a polystyrene microbead (0.035 
to 0.2um in diameter) that is coated with avidin or streptavidin, that will therefore bind the nucleic acid with very high 
40 affinity. This bead can be derivatised with substrate or ligand by any suitable method such as by adding biotinylated 
substrate or by covalent coupling. 

[0071 ] Alternatively, a biotinylated nucleic acid may be coupled to avidin or streptavidin complexed to a large protein 
molecule such as thyroglobulin (669 Kd) or ferritin (440 Kd). This complex can be derivatised with substrate or ligand, 
for example by covalent coupling to the £-amino group of lysines or through a non-covalent interaction such as biotin- 

45 avidin. The substrate may be present in a form unlinked to the genetic element but containing an inactive "tag" that 
requires a further step to activate it such as photoactivation (e.g. of a "caged" biotin analogue, (Sundberg eta!., 1995; 
Pirrung and Huang, 1 996)). The catalyst to be selected then converts the substrate to product. The "tag" could then 
be activated and the "tagged" substrate and/or product bound by a tag-binding molecule (e.g. avidin or streptavidin) 
complexed with the nucleic acid. The ratio of substrate to product attached to the nucleic acid via the "tag" will therefore 

so reflect the ratio of the substrate and product in solution. 

[0072] An alternative is to couple the nucleic acid to a product-specific antibody (or other product-specific molecule). 
In this scenario, the substrate (or one of the substrates) is present in each microcapsule unlinked to the genetic element, 
but has a molecular "tag" (for example biotin, DIG or DNP). When the catalyst to be selected converts the substrate 
to product, the product retains the "tag" and is then captured in the microcapsule by the product-specific antibody. In 

55 this way the genetic element only becomes associated with the "tag" when it encodes or produces an enzyme capable 
of converting substrate to product. 

[0073] When all reactions are stopped and the microcapsules are combined, the genetic elements encoding active 
enzymes can be enriched using an antibody or other molecule which binds, or reacts specifically with the "tag". Although 
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both substrates and product have the molecular tag, only the genetic elements encoding active gene product will co- 
purify. 

[00741 The terms "isolating", "sorting" and "selecting", as well as variations thereof, are used herein. Isolation, ac- 
cording to the present invention, refers to the process of separating an entity from a heterogeneous population, for 

5 example a mixture, such that it is free of at least one substance with which it was associated before the isolation 
process. In a preferred embodiment, isolation refers to purification of an entity essentially to homogeneity. Sorting of 
an entity refers to the process of preferentially isolating desired entities over undesired entities. In as far as this relates 
to isolation of the desired entities, the terms "isolating" and "sorting" are equivalent. The method of the present invention 
permits the sorting of desired genetic elements from pools (libraries or repertoires) of genetic elements which contain 

10 the desired genetic element. Selecting is used to refer to the process (including the sorting process) of isolating an 
entity according to a particular property thereof. 

[0075J In a highly preferred application, the method of the present invention is useful for sorting libraries of genetic 
elements. The invention accordingly provides a method according to preceding aspects of the invention, wherein the 
genetic elements are isolated from a library of genetic elements encoding a repertoire of gene products. Herein, the 
'5 terms "library", "repertoire" and "poor are used according to their ordinary signification in the art, such that a library of 
genetic elements encodes a repertoire of gene products. In general, libraries are constructed from pools of genetic 
elements and have properties which facilitate sorting. 

{0076] Initial selection of a genetic element from a genetic element library using the present invention will in most 
cases require the screening of a large number of variant genetic elements. Libraries of genetic elements can be created 

20 in a variety of different ways, including the following. 

[0077] Pools of naturally occurring genetic elements can be cloned from genomic DNA or cDNA (Sambrook et ai, 
1 989) ; for example, phage antibody libraries, made by PCR amplification repertoires of antibody genes from immunised 
or unimmunised donors have proved very effective sources of functional antibody fragments (Winter etai, 1 994; Hoog- 
enboom, 1997). Libraries of genes can also be made by encoding all (see for example Smith, 1985; Parmley and 

25 Smith, 1988) or part of genes (see for example Lowman et a/., 1991 ) or pools of genes (see for example Nissim et ai, 
1994) by a randomised or doped synthetic oligonucleotide. Libraries can also be made by introducing mutations into 
a genetic element or pool of genetic elements 'randomly' by a variety of techniques in vivo, including; using 'mutator 
strains', of bacteria such as E coli mutD5 (Liao etal, 1 986; Yamagishi etal., 1990; Low etai, 1 996); using the antibody 
hypermutation system of B-lymphocytes (Yelamos et ai, 1 995). Random mutations can also be introduced both in vivo 

30 and in vitro by chemical mutagens, and ionising or UV irradiation (see Friedberg et ai, 1995), or incorporation of 
mutagenic base analogues (Freese, 1 959; Zaccolo et ai, 1 996). 'Random' mutations can also be introduced into genes 
in vitro during polymerisation for example by using error-prone polymerases (Leung etaL, 1989). 
[0078] Further diversification can be introduced by using homologous recombination either in vivo (see Kowalc- 
zykowski et ai, 1994 or in vitro (Stemmer, 1 994a; Stemmer, 1 994b)). 

35 [0079J According to a further aspect of the present invention, therefore, there is provided a method of in vitro evolution 
comprising the steps of: 



(a) selecting one or more genetic elements from a genetic element library according to the present invention; 

(b) mutating the selected genetic elements) in order to generate a further library of genetic elements encoding a 
repertoire to gene products; and 

(c) iteratively repeating steps (a) and (b) in order to obtain a gene product with enhanced activity. 
[0080] Mutations may be introduced into the genetic elements(s) as set forth above. 

[0081] The genetic elements according to the invention advantageously encode enzymes, preferably of pharmaco- 
logical or industrial interest, activators or inhibitors, especially of biological systems, such as cellular signal transduction 
mechanisms, antibodies and fragments thereof, other binding agents suitable for diagnostic and therapeutic applica- 
tions. In a preferred aspect, therefore, the invention permits the identification and isolation of clinically or industrially 
useful products. In a further aspect of the invention, there is provided a product when isolated by the method of the 
invention. 

[0082] The selection of suitable encapsulation conditions is desirable. Depending on the complexity and size of the 
library to be screened, it may be beneficial to set up the encapsulation procedure such that 1 or less than 1 genetic 
element is encapsulated per microcapsule. This will provide the greatest power of resolution. Where the library is larger 
and/or more complex, however, this may be impracticable; it may be preferable to encapsulate several genetic elements 
together and rely on repeated application of the method of the invention to achieve sorting of the desired activity. A 
combination of encapsulation procedures may be used to obtain the desired enrichment. 

[0083] Theoretical studies indicate that the larger the number of genetic element variants created the more likely it 
is that a molecule will be created with the properties desired (see Perelson and Oster, 1979 for a description of how 
this applies to repertoires of antibodies). Recently it has also been confirmed practically that larger phage-antibody 
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repertoires do indeed give rise lo more antibodies with better binding affinities than smaller repertoires (Griffiths etai, 
1 994). To ensure that rare variants are generated and thus are capable of being selected, a large library size is desirable. 
Thus, the use of optimally small microcapsules is beneficial. 

[0084] The largest repertoire created to date using methods that require an in vivo step (phage-display and Lad 
5 systems) has been a 1.6 x 10 11 clone phage-peptide library which required the fermentation of 15 litres of bacteria 
(Fisch et at., 1996). SELEX experiments are often carried out on very large numbers of variants (up to 10 15 ). 
[0085] Using the present invention, at a preferred microcapsule diameter of 2.6u.m, a repertoire size of at least 1 0 11 
can be selected using 1ml aqueous phase in a 20 ml emulsion. 

[0086] In addition to the genetic elements described above, the microcapsules according to the invention will com- 

10 prise further components required for the sorting process to take place . Other components of the system will for example 
comprise those necessary for transcription and/or translation of the genetic element. These are selected for the re- 
quirements of a specific system from the following; a suitable buffer, an in vitro transcription/replication system and/or 
- an in vitro translation system containing all the necessary ingredients, enzymes and cofactors, RNA polymerase, nu- 
cleotides, nucleic acids (natural or synthetic), transfer RNAs, ribosomes and amino acids, and the substrates of the 

15 reaction of interest in order to allow selection of the modified gene product. 

[0087] A suitable buffer will be one in which all of the desired components of the biological system are active and 
will therefore depend upon the requirements of each specific reaction system. Buffers suitable for biological and/or 
i chemical reactions are known in the art and recipes provided in various laboratory texts, such as Sambrook etal, 1989. 
[0088] The in vitro translation system will usually comprise a cell extract, typically from bacteria (Zubay, 1 973; Zubay, 

20 1 980; Lesley et al, 1 991 ; Lesley, 1 995), rabbit reticulocytes (Pelham and Jackson, 1 976), or wheat germ (Anderson 
et a/., 1983). Many suitable systems are commercially available (for example from Promega) including some which 
will allow coupled transcription/translation (all the bacterial systems and the reticulocyte and wheat germ TNT™ extract 
systems from Promega). The mixture of amino acids used may include synthetic amino acids if desired, to increase 
the possible number or variety of proteins produced in the library. This can be accomplished by charging tRNAs with 

25 artificial amino acids and using these tRNAs for the in vitro translation of the proteins to be selected (EHman et aL, 
1991; Benner, 1994; Mendel et al., 1995). 

[0089] After each round of selection the enrichment of the pool of genetic elements for those encoding the molecules 
of interest can be assayed by non-compartmentalised in vitro transcription/replication or coupled transcription -trans- 
lation reactions. The selected pool is cloned into a suitable plasmid vector and RNA or recombinant protein is produced 

30 from the individual clones for further purification and assay. 

[0090] The invention moreover relates to a method for producing a gene product, once a genetic element encoding 
the gene product has been sorted by the method of the invention. Clearly, the genetic element itself may be directly 
expressed by conventional means to produce the gene product. However, alternative techniques may be employed, 
as will be apparent to those skilled in the art. For example, the genetic information incorporated in the gene product 

35 may be incorporated into a suitable expression vector, and expressed therefrom. 

[0091] The invention also describes the use of conventional screening techniques to identify compounds which are 
capable of interacting with the gene products identified by the first aspect of the invention. In preferred embodiments, 
gene product encoding nucleic acid is incorporated into a vector, and introduced into suitable host cells to produce 
transformed cell lines that express the gene product. The resulting cell lines can then be produced for reproducible 

40 qualitative and/or quantitative analysis of the effect (s) of potential drugs affecting gene product function. Thus gene 
. product expressing cells may be employed for the identification of compounds, particularly small molecular weight 
compounds, which modulate the function of gene product. Thus host cells expressing gene product are useful for drug 
screening and it is a further object of the present invention to provide a method for identifying compounds which mod- 
ulate the activity of the gene product, said method comprising exposing cells containing heterologous DNA encoding 

4 5 - gene product, wherein said cells produce functional gene product, to at least one compound or mixture of compounds 
or signal whose ability to modulate the activity of said gene product is sought to be determined, and thereafter monitoring 
said cells for changes caused by said modulation. Such an assay enables the identification of modulators, such as 
agonists, antagonists and allosteric modulators, of the gene product. As used herein, a compound or signal that mod- 
ulates the activity of gene product refers to a compound that alters the activity of gene product in such a way that the 

50 activity of gene product is different in the presence of the compound or signal (as compared to the absence of said 
compound or signal). 

[0092] Cell-based screening assays can be designed by constructing cell lines in which the expression of a reporter 
protein, i.e. an easily assayable protein, such as b galactosidase, chloramphenicol acetyltransferase (CAT) or luci- 
ferase, is dependent on gene product. Such an assay enables the detection of compounds that directly modulate gene 
55 product function, such as compounds that antagonise gene product, or compounds that inhibit or potentiate other 
cellular functions required for the activity of gene product. 

[0093] The present invention also provides a method to exogenously affect gene product dependent processes oc- 
curring in cells. Recombinant gene product producing host cells, e.g. mammalian cells, can be contacted with a test 
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compound, and the modulating effect(s) thereof can then be evaluated by comparing the gene product-mediated re- 
sponse in the presence and absence of test compound, or relating the gene product-mediated response of test cells, 
or control cells (i.e... cells that do not express gene product), to the presence of the compound. 
[0094] In a further aspect, the invention relates to a method for optimising a production process which involves at 
5 least one step which is facilitated by a polypeptide. For example, the step may be a catalytic step, which is facilitated 
by an enzyme. Thus, the invention provides a method for preparing a compound or compounds comprising the steps of: 

(a) providing a synthesis protocol wherein at least one step is facilitated by a polypeptide; 

(b) preparing genetic elements encoding variants of the polypeptide which facilitates this step; 
io (c) compartmentalising the genetic elements into microcapsules; 

(d) expressing the genetic elements to produce their respective gene products within the microcapsules; 

(e) sorting the genetic elements which produce polypeptide gene product(s) having the desired activity; and 

(0 preparing the compound or compounds using the polypeptide gene product identified in (e) to facilitate the 
relevant step of the synthesis. 

15 

[0095] By means of the invention, enzymes involved in the preparation of a compound may be optimised by selection 
for optimal activity. The procedure involves the preparation of variants of the polypeptide to be screened, which equate 
to a library of polypeptides as refereed to herein. The variants may be prepared in the same manner as the libraries 
discussed elsewhere herein. 

20 

(B) SELECTION PROCEDURES 

[0096] The system can be configured to select for RNA, DNA or protein gene product molecules with catalytic, reg- 
ulatory or binding activity. 

25 

(i) AFFINITY SELECTION 

[0097] In the case of selection for a gene product with affinity for a specific ligand the genetic element may be linked 
to the gene product in the microcapsule via the ligand. Only gene products with affinity for the ligand will therefore bind 
30 to the genetic element itself and therefore only genetic elements that produce active product will be retained in the 
selection step. In this embodiment, the genetic element will thus comprise a nucleic acid encoding the gene product 
linked to a ligand for the gene product. 

[0098] In this embodiment, all the gene products to be selected contain a putative binding domain, which is to be 
selected for, and a common feature - a tag. The genetic element in each microcapsule is physically linked to the ligand. 

35 (f the gene product produced from the genetic element has affinity for the ligand, it will bind to it and become physically 
linked to the same genetic element that encoded it, resulting in the genetic element being 'tagged'. At the end of the 
reaction, all of the microcapsules are combined, and all genetic elements and gene products pooled together in one 
environment. Genetic elements encoding gene products exhibiting the desired binding can be selected by affinity pu- 
rification using a molecule that specifically binds to, or reacts specifically with, the "tag". 

40 [0099] In an alternative embodiment, genetic elements may be sorted on the basis that the gene product, which 
binds to the ligand, merely hides the ligand from, for example, further binding partners. In this eventuality, the genetic 
element, rather than being retained during an affinity purification step, may be selectively eluted whilst other genetic 
elements are bound. 

[0100] In an alternative embodiment, the invention provides a method according to the first aspect of the invention, 
45 wherein in step (b) the gene products bind to genetic elements encoding them. The gene products together with the 
attached genetic elements are then sorted as a result of binding of a ligand to gene products having the desired activity. 
For example, all gene products can contain an invariant region which binds covalently or non-covalently to the genetic 
element, and a second region which is diversified so as to generate the desired binding activity. 
[0101] Sorting by affinity is dependent on the presence of two members of a binding pair in such conditions that 
so binding may occur. Any binding pair may be used for this purpose. As used herein, the term binding pair refers to any 
pair of molecules capable of binding to one another. Examples of binding pairs that may be used in the present invention 
include an antigen and an antibody or fragment thereof capable of binding the antigen, the biotin-avidin/streptavidin 
pair (Savage et at., 1994), a calcium-dependent binding polypeptide and ligand thereof (e.g. calmodulin and a calmod- 
ulin-binding peptide (Stofko et at., 1992; Montigiani et ai ,1996)), pairs of polypeptides which assemble to form a 
55 leucine zipper (Tripet et al., 1996), histidines (typically hexahistidine peptides) and chelated Cu 2+ , 2n 2+ and Ni 2 \ (e. 
g. Ni-NTA; Hochuli et al., 1987), RNA-binding and DNA-binding proteins (Klug, 1995) including those containing zinc- 
finger motifs (Klug and Schwabe : 1995) and DNA methyltransferases (Anderson, 1993), and their nucleic acid binding 
sites. 
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(if) CATALYSIS 

[0102] When selection is for catalysis, the genetic element in each microcapsule may comprise the substrate of the 
reaction. If the genetic element encodes a gene product capable of acting as a catalyst, the gene product will catalyse 

5 the conversion of the substrate into the product. Therefore, at the end of the reaction the genetic element is physically 
linked to the product of the catalysed reaction . When the microcapsules are combined and the reactants pooled, genetic 
elements encoding catalytic molecules can be enriched by selecting for any property specific to the product (Figure 1 ). 
[0103] For example, enrichment can be by affinity purification using a molecule (e.g. an antibody) that binds specif- 
ically to the product. Equally, the gene product may have the effect of modifying a nucleic acid component of the genetic 

10 element, for example by methylation (or demethylation) or mutation of the nucleic acid, rendering it resistant to or 
susceptible to attack by nucleases, such as restriction endonucleases. 

[0104] Alternatively, selection may be performed indirectly by coupling a first reaction to subsequent reactions that 
lakes place in the same microcapsule. There are two general ways in which this may be performed. First, the product 
of the first reaction could be reacted with, or bound by, a molecule which does not react with the substrate of the first 

15 reaction. A second, coupled reaction will only proceed in the presence of the product of the first reaction. An active 
genetic element can then be purified by selection for the properties of the product of the second reaction. 
[01 05] Alternatively, the product of the reaction being selected may be the substrate or cof actor for a second enzyme- 
catalysed reaction. The enzyme to catalyse the second reaction can either be translated in situ in the microcapsules 
or incorporated in the reaction mixture prior to microencapsulation. Only when the first reaction proceeds will the cou- 

20 pled enzyme generate a selectable product. 

[0106] This concept of coupling can be elaborated to incorporate multiple enzymes, each using as a substrate the 
product of the previous reaction. This allows for selection of enzymes that will not react with an immobilised substrate. 
It can also be designed to give increased sensitivity by signal amplification if a product of one reaction is a catalyst or 
a cofactor for a second reaction or series of reactions leading to a selectable product ( for example, see Johannsson 

25 and Bates, 1988; Johannsson, 1991). Furthermore an enzyme cascade system can be based on the production of an 
activator for an enzyme or the destruction of an enzyme inhibitor (see Mize et si, 1989). Coupling also has the advan- 
tage that a common selection system can be used for a whole group of enzymes which generate the same product 
and allows for the selection of complicated chemical transformations that cannot be performed in a single step. 
[0107] Such a method of coupling thus enables the evolution of novel "metabolic pathways" in vitro in a stepwise 

30 fashion, selecting and improving first one step and then the next. The selection strategy is based on the final product 
of the pathway, so that all earlier steps can be evolved independently or sequentially without setting up a new selection 
system for each step of the reaction. 

[0108] Expressed in an alternative manner, there is provided a method of isolating one or more genetic elements 
encoding a gene product having a desired catalytic activity, comprising the steps of: 

35 

(1) expressing genetic elements to give their respective gene products; 

(2) allowing the gene products to catalyse conversion of a substrate to a product, which may or may not be directly 
selectable, in accordance with the desired activity; 

(3) optionally coupling the first reaction to one or more subsequent reactions, each reaction being modulated by 
40 the product of the previous reactions, and leading to the creation of a final, selectable product; 

(4) linking the selectable product of catalysis to the genetic elements by either; 

a) coupling a substrate to the genetic elements in such a way that the product remains associated with the 
genetic elements, or 

45 *>) reacting or binding the selectable product to the genetic elements by way of a suitable molecular "tag" 

attached to the substrate which remains on the product, or 

c) coupling the selectable product (but not the substrate) to the genetic elements by means of a product- 
specific reaction or interaction with the product; and 

50 (5) selecting the product of catalysis, together with the genetic element to which it is bound, either by means of a 

specific reaction or interaction with the product, or by affinity purification using a suitable molecular "tag" attached 
to the product of catalysis, wherein steps (1 ) to (4) each genetic element and respective gene product is contained 
within a microcapsule. 

55 (jjj) REGULATION 

[01 09] A similar system can be used to select for regulatory properties of enzymes. 

[01 10] In the case of selection for a regu lator molecule which acts as an activator or inhibitor of a biochemical process. 
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the components of the biochemical process can either be translated in situ in each microcapsule or can be incorporated 
in the reaction mixture prior to microencapsulation. If the genetic element being selected is to encode an activator, 
selection can be performed for the product of the regulated reaction, as described above in connection with catalysis. 
If an inhibitor is desired, selection can be for a chemical property specific to the substrate of the regulated reaction. 
5 [0111] There is therefore provided a method of sorting one or more genetic elements coding for a gene product 
exhibiting a desired regulatory activity, comprising the steps of: 

(1 ) expressing genetic elements to give their respective gene products; 

(2) allowing the gene products to activate or inhibit a biochemical reaction, or sequence of coupled reactions, in 
10 accordance with the desired activity, in such a way as to allow the generation or survival of a selectable molecule; 

(3) linking the selectable molecule to the genetic elements either by 

a) having the selectable molecule, or the substrate from which it derives, attached to the genetic elements, or 

b) reacting or binding the selectable product to the genetic elements, by way of a suitable molecular "tag" 
'5 attached to the substrate which remains on the product, or 

c) coupling the product of catalysis {but not the substrate) to the genetic elements, by means of a product- 
specific reaction or interaction with the product; 

(4) selecting the selectable product, together with the genetic element to which it is bound, either by means of a 
20 specific reaction or interaction with the selectable product, or by affinity purification using a suitable molecular "tag" 

attached to the product of catalysis. 

wherein steps (1) to (4) each genetic element and respective gene product is contained within a microcapsule. 

25 (iv) MICROCAPSULE SORTING 

[01 1 2] The invention provides for the sorting of intact microcapsules where this is enabled by the sorting techniques 
being employed. Microcapsules may be sorted as such when the change induced by the desired gene product either 
occurs or manifests itself at the surface of the microcapsule or is detectable from outside the microcapsule. The change 

30 may be caused by the direct action of the gene product, or indirect, in which a series of reactions, one or more of which 
involve the gene product having the desired activity leads to the change. For example, the microcapsule may be so 
configured that the gene product is displayed at its surface and thus accessible to reagents. Where the microcapsule 
is a membranous microcapsule, the gene product may be targeted or may cause the targeting of a molecule to the 
membrane of the microcapsule. This can be achieved, for example, by employing a membrane localisation sequence, 

35 such as those derived from membrane proteins, which will favour the incorporation of a fused or linked molecule into 
the microcapsule membrane. Alternatively, where the microcapsule is formed by phase partitioning such as with water- 
in-oil emulsions, a molecule having parts which are more soluble in the extra -capsular phase will arrange themselves 
such that they are present at the boundary of the microcapsule. 

[01 13] In a preferred aspect of the invention, however, microcapsule sorting is applied to sorting systems which rely 
40 on a change in the optical properties of the microcapsule : for example absorption or emission characteristics thereof, 
for example alteration in the optical properties of the microcapsule resulting from a reaction leading to changes in 
absorbance, luminescence, phosphorescence or fluorescence associated with the microcapsule. All such properties 
are included in the term "opticar. In such a case, microcapsules can be sorted by luminescence, fluorescence or 
phosphorescence activated sorting. In a highly preferred embodiment, fluorescence activated sorting is employed to 
45 sort microcapsules in which the production of a gene product having a desired activity is accompanied by the production 
of a fluorescent molecule in the cell. For example, the gene product itself may be fluorescent, for example a fluorescent 
protein such as GFP. Alternatively, the gene product may induce or modify the fluorescence of another molecule, such 
as by binding to it or reacting with it. 

so ( V ) MICROCAPSULE IDENTIFICATION 

[0114] Microcapsules may be identified by virtue of a change induced by the desired gene product which either 
occurs or manifests itself at the surface of the microcapsule or is detectable from the outside as described in section 
iii (Microcapsule Sorting). This change, when identified, is used to trigger the modification of the gene within the com- 
55 partment. In a preferred aspect of the invention, microcapsule identification relies on a change in the optical properties 
of the microcapsule resulting from a reaction leading to luminescence, phosphorescence or fluorescence within the 
microcapsule. Modification of the gene within the microcapsules would be triggered by identification of luminescence, 
phosphorescence or fluorescence. For example, identification of luminescence, phosphorescence or fluorescence can 
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trigger bombardment of the compartment with photons (or other particles or waves) which leads to modification of the 
genetic element. A similar procedure has been described previously for the rapid sorting of cells (Keij et at., 1994). 
Modification of the genetic element may result, for example, from coupling a molecular "tag", caged by a photolabile 
protecting group to the genetic elements: bombardment with photons of an appropriate wavelength leads to the removal 
5 of the cage. Afterwards, all microcapsules are combined and the genetic elements pooled together in one environment. 
Genetic elements encoding gene products exhibiting the desired activity can be selected by affinity purification using 
a molecule that specifically binds to, or reacts specifically with, the "tag". 

(vi) MULTI-STEP PROCEDURE 

10 

[0115] It will be also be appreciated that according to the present invention, it is not necessary for all the processes 
of transcription/replication and/or translation, and selection to proceed in one single step, with all reactions taking place 
in one microcapsule. The selection procedure may comprise two or more steps. First, transcription/replication and/or 
translation of each genetic element of a genetic element library may take place in a first microcapsule. Each gene 

15 product is then linked to the genetic element which encoded it (which resides in the same microcapsule). The micro- 
capsules are then broken, and the genetic elements attached to their respective gene products optionally purified. 
Alternatively, genetic elements can be attached to their respective gene products using methods which do not rely on 
encapsulation. For example phage display (Smith, G.P.,1 985), polysome display (Mattheakkis et at., 1 994) r RNA-pep- 
tide fusion (Roberts and Szostak, 1 997) or lac repressor peptide fusion (Cull, et at, 1992). 

20 [0116] In the second step of the procedure, each purified genetic element attached to its gene product is put into a 
second microcapsule containing components of the reaction to be selected. This reaction is then initiated. After com- 
pletion of the reactions, the microcapsules are again broken and the modified genetic elements are selected. In the 
case of complicated multistep reactions in which many individual components and reaction steps are involved, one or 
more intervening steps may be performed between the initial step of creation and linking of gene product to genetic 

25 element, and the final step of generating the selectable change in the genetic element. 

(vii) SELECTION BY ACTIVATION OF REPORTER GENE EXPRESSION IN SITU 

[0117] The system can be configured such that the desired binding, catalytic or regulatory activity encoded by a 
30 genetic element leads, directly or indirectly to the activation of expression of a "reporter gene" that is present in all 
microcapsules. Only gene products with the desired activity activate expression of the reporter gene. The activity 
resulting from reporter gene expression allows the selection of the genetic element (or of the compartment containing 
it) by any of the methods described herein. 

[0118] For example, activation of the reporter gene may be the result of a binding activity of the gene product in a 
35 manner analogous to the "two hybrid system" (Fields and Song, 1989). Activation might also result from the product 
of a reaction catalysed by a desirable gene product. For example, the reaction product could be a transcriptional inducer 
of the reporter gene. For example arabinose could be used to induce transcription from the araBAD promoter. The 
activity of the desirable gene product could also result in the modification of a transcription factor resulting in expression 
of the reporter gene. For example, if the desired gene product is a kinase or phosphatase the phosphorylation or 
40 dephosphorylation of a transcription factor may lead to activation of reporter gene expression. 

(viii) AMPLIFICATION 

[01 19] According to a further aspect of the present invention the method comprises the further step of amplifying the 
45 genetic elements. Selective amplification may be used as a means to enrich for genetic elements encoding the desired 
gene product. 

[0120] In all the above configurations, genetic material comprised in the genetic elements may be amplified and the 
process repeated in iterative steps. Amplification may be by the polymerase chain reaction (Saiki et al, 1988) or by 
using one of a variety of other gene amplification techniques including; Qp replicase amplification (Cahill, Foster and 
50 Mahan, 1991; Chetverin and Spirin, 1995; Katanaev, Kumasov and Spirin, 1995); the ligase chain reaction (LCR) 
(Landegren et at, 1988; Barany, 1991); the self-sustained sequence replication system (Fahy, Kwoh and Gingeras, 
1 991 ) and strand displacement amplification (Walker et at., 1 992). 

(ix) COMPARTMENTALISATION 

55 

[0121] According to a further aspect of the present invention, there is provided a method for compartmentalising a 
genetic element and expressing the genetic element to form its gene product within the compartment, comprising the 
steps of: 
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(a) forming an aqueous solution comprising the genetic element and the components necessary to express it to 
form its gene product; 

(b) microencapsulating the solution so as to form a discrete microcapsule comprising the genetic element; and 

(c) exposing the microcapsule to conditions suitable for the expression of the genetic element to form its gene 
5 product to proceed. 

[0122] Suitable microencapsulation techniques are described in detail in the foregoing general description. 
[01 23] Preferably, a library of genetic elements encoding a repertoire of gene products is encapsulated by the method 
set forth above, and the genetic elements expressed to produce their respective gene products, in accordance with 
to the invention. In a highly preferred embodiment, microencapsulation is achieved by forming a water-in-oil emulsion of 
the aqueous solution comprising the genetic elements. 

[0124] The invention, accordingly, also provides a microcapsule obtainable by the method set forth above. 
[01 25] Various aspects and embodiments of the present invention are illustrated in the following examples. It will be 
appreciated that modification of detail may be made without departing from the scope of the invention. 
'5 [0126] All documents mentioned in the text are incorporated by reference. 

EXAMPLES 

Example 1. 

20 

The production of approx. 2u.m aqueous microcapsules in a water-in-oil emulsion system. 

[0127] Microcapsules within the preferred size range of the present invention can be generated using a water-in-oil 
emulsion system. 

25 [01 28] Light white mineral oil (Sigma; M-351 6) is used herein as the continuous phase and the emulsion is stabilised 
by emulsifiers sorbitan monooleate (Span 80, Ruka; 85548) and polyoxyethylenesorbitan monooleate (Tween 80, 
Sigma Ultra; P-8074) and in some cases also with 0.5% w/v sodium deoxycholate (Ruka; 30970). 
[0129] The oil phase is freshly prepared by dissolving 4.5% (v/v) Span 80 (Fluka) in mineral oil (Sigma, #M-5904) 
followed by 0.5% (v/v) Tween 80 (SigmaUltra; #P-8074). Ice-cooled in vitro reaction mixtures (50 ut) are added gradually 

30 (in 5 aliquots of 1 0 uJ over -2 minutes) to 0.95 ml of ice-cooled oil-phase in a 5 ml Costar Biofreeze Vial (#2051 ) whilst 
stirring with a magnetic bar (8x3 mm with a pivot ring; Scientific Industries International, Loughborough, UK). Stirring 
(at 1150 rpm) is continued for an additional 1 minute on ice. In some emulsions the aqueous phase is supplemented 
with an anionic surfactant - e.g., sodium deoxycholate, sodium cholate, sodium glycochoiate, and sodium taurocholate, 
typically to 0.5% (w/v). 

35 [0130] When indicated, the emulsion is further homogenised using an Ultra-Turrax T25 disperser (IKA) equipped 
with an 8 mm diameter dispersing tool at 8k, 9k or 13.5k rpm for 1 minute, or at 20k rpm for t or 5 minutes, on ice. 
This reduces the microcapsule size. 

[01 31 ] The reactions may be quenched and the emulsion broken as indicated in individual examples, by spinning at 
3,000 g lor 5 minutes and removing the oil phase, leaving the concentrated emulsion at the bottom of the vial. Quenching 
40 buffer (typically, 0.2 ml of 25 ng/ml yeast RNA in W+B buffer: 1 M NaCI, 10 mM Tris-HCI, 1 mM EDTA pH 7.4) and 2 
ml of water-saturated diethyl ether is added and the mixture vortexed, centrifuged briefly, and the ether phase removed. 
The aqueous phase is washed with ether and dried (5 minutes in a Speedvac at ambient temperature). 
[0132] The size distribution of the aqueous droplets in the emulsions was determined by laser diffraction using a 
Coulter LS230 Particle Size Analyser. An aliquot of emulsion, freshly diluted (1:10) in mineral oil is added to the micro- 
ns volume chamber containing stirred mineral oil. Results are analysed with the instrument's built-in Mie optical model 
using refractive indices of 1 .468 for mineral oil and 1 .350 for the aqueous phase. The size distribution of the aqueous 
droplets in the emulsion is shown in Rgure 2. Addition of sodium deoxycholate does not significantly alter the size 
distribution. 

50 Example 2. 

Efficient in vitro transcription reactions performed in the aqueous microcapsules of a water-in-oil emulsion. 

[0133] In order to produce RNA from DNA within each microcapsule, the single molecule of DNA present within each 
55 aqueous microcapsule of the system must be transcribed efficiently. Herein, in vitro transcription is demonstrated within 
microcapsules. 

[0134] The catalytic core of the Tetrahymena self-splicing intron is a much-studied ribozyme which can catalyse a 
variety of phosphoestertransfer reactions (Sag eta!., 1986; Sag and Czech, 1986; Sag and Czech, 1986). For example, 
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a modified Tetrahymena intron missing the P1 stem-loop from the S'-end, and missing the 3' stem-loops P9.1 and P9.2 
can function as an RNA ligase, efficiently splicing together two or more oligonucleotides aligned on a template strand 
(Green and Szostak. 1992). 

[0135] DNA encoding the above-described Tetrahymena ribozyme is PCR-amplified using primers P2T7Ba (which 
5 anneals to the P2 loop region and appends a T7 RNA polymerase promoter) and P9Fo (which anneals to the P9 loop 
region). This creates a 331 base pair DNA fragment carrying the T7 RNA polymerase promoter. This fragment is purified 
directly using Wizard PCR Preps (Promega) and used as the template for an in vitro transcription reaction using T7 
RNA polymerase. 

[0136J In vitro transcription is assayed over an initial 10 minute period during which the reaction rate is essentially 
10 linear (Chamberlin and Ring, 1 973). Reaction conditions for transcription are as described by Wyatt et ai, 1 991 
[0137] Incorporation of [f 32 P] UTP is used to assay the progression of the reaction. 

[0138] A transcription reaction is set up in a volume of 200uJ and divided into 2 aliquots, each containing 3 x 10" 
molecules of DNA (5nM). One 1 OOuJ aliquot is added to 2ml Sigma light mineral oil containing 4.5% Span 80 and 0.5% 
Tween 80 and homogenised for 5 minutes with an Ultra-Turrax T25 disperser at 20,000 rpm as in Example 1. Based 
15 on the mean microcapsule volume in these emulsions (2.8 X 1 O^m* for a 0.81 u.m diameter microcapsule) the 100uJ 
reaction would be divided into 3.6 x 10 11 microcapsules. Hence, there should be 1 molecule of DNA per microcapsule 
on average. 

[0139] Both aliquots are incubated in a 37°C water bath. 0.5 ml samples of the emulsion are removed both before 
the start of the incubation and after 10 minutes and placed on ice. Similar 25uJ samples are removed from the non- 
20 emulsified control reactions at the same time. Emulsions are broken and reactions stopped with 0.5 ml EDTA (50 mM) 
and 2 ml water-saturated diethyl ether as described in Example 1. 100u.i salmon sperm DNA (500u,g/ml) in 20 mM 
EDTA is then added. Three 100^1 aliquots are then removed from both emulsions and controls and labelled RNA is 
assayed by TCA precipitation and scintillation counting. 

[0140J The rate of transcription is taken as the increase in acid perceptible cpm over the 10 minute incubation at 
25 37°C. In the non emulsified control reaction there are 442,000 cpm acid perceptible material compared to 147,000 
cpm in the emulsion. Hence the rate of transcription in the emulsion is 33% of that found in the non-emulsif ied control 
reaction. 

[01 41 ] This procedure therefore shows that RNA can be efficiently synthesised by T7 RNA polymerase in the aqueous 
microcapsules of a water-in-oil emulsion. 

30 

Example 3. 

Efficient coupled in vitro transcription/translation reactions performed in the aqueous microcapsules of a 
water-in-oil emulsion. 

35 

[01 42] I n order to synthesise proteins using the procedure of the present invention, translation must be active in the 
aqueous microcapsules of the water-in-oil emulsion described herein. 

[0143] Here it is shown how a protein (E. coli dihydrofolate reductase) can be efficiently produced from DNA in the 
aqueous microcapsules of a water-in-oil emulsion system using a coupled transcription/translation system. 
[01 44] The £. coli folk gene encoding dihydrofolate reductase (DHFR) is PCR-amplified using oligonucleotides ED- 
HFRFo and EDHFRBa. This DNA is then cloned into the pGEM-42 vector (Promega) digested with HindlW and Kpn\ 
downstream of the both the lac promoter and the T7 RNA polymerase promoter. The oligonucleotide EDHFRBa ap- 
pends the efficient phage T7 gene 10 translational start site upstream of the DHFR start codon. 
[0145] DNA sequencing identifies a clone which has the correct nucleotide sequence. Bacteria transformed with this 
clone (pGEM to/A) are found to over express active DHFR (driven from the lac promoter) when induced with IPTG. 
[01 46] The pGEM to/A plasmid is then PCR-amplified using primers LMB2 and LMB3 under the conditions described 
above to create a 649bp DNA fragment carrying the T7 RNA polymerase promoter, the phage T7 gene 1 0 translational 
start site and the to/A gene. This PCR fragment is purified directly using Wizard PCR Preps (Promega) and used to 
program a prokaryotic in vitro coupled transcription/translation system designed for linear templates (Lesley Brow and 
50 Burgess, 1991). 

[0147] A commercial preparation of this system is used (E. coli S30 Extract System for Linear Templates; Promega) 
supplemented with T7 RNA polymerase. 

[0148] A 300uJ translation reaction is set up on ice containing 3 x 10 12 molecules of DNA. T7 RNA polymerase (10 4 
units) is added to drive transcription and the translated protein is labelled by the addition of p 5 S] methionine. A 150u.l 
55 aliquot of this reaction is added to 2.85 ml Sigma light mineral oil containing 4.5% Span 80 and 0.5% Tween 80 and 
homogenised for 1 minute with an Ultra-Turrax T25 disperser at 20,000 rpm, as in Example 1 . The other aliquot is not 
emulsified. 

[01 49] Based on the mean microcapsule volume in the emulsions ( 1 .1 x 1 0"^ m 3 for a ^ 2 9jim diameter microcapsule) 
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the 150uf reaction would be divided into 1.3 x 10 11 , microcapsules). Hence, there should be roughly 11 molecules of 
DNA per microcapsule. 

[0150] Four 0.5ml aliquots are removed from the emulsion reaction mix. One aliquot is immediately put on ice and 
the other three are incubated in a 25°C water bath for 2 hours before being placed on ice. Four 25uJ samples are also 
5 removed from the non-emulsified reaction mix; one is put immediately on ice and the other three are incubated in a 
25°C water bath for 2 hours and then placed on ice. 

[0151] The emulsions are spun in a microfuge at 13,000 r.p.m. for 5 minutes at 4°C and the mineral oil removed 
leaving the concentrated (but still intact) emulsion at the bottom of the tube. After briefly re-spinning and removing any 
further mineral oil, the emulsion is broken and any further translation stopped by adding 100uJ water containing 1 25u.g/ 
10 ml puromycin, and 1 ml water saturated diethyl ether. This mixture is vortexed and respun in a microfuge at 13,000 r. 
p.m. for 1 minute at 4°C. The ether and dissolved mineral oil is then removed by aspiration and the extraction repeated 
with a further 1 ml of ether. Any remaining ether is driven off by spinning for 5 minutes in a Speedvac at room temper- 
ature. 

[0152] 1 0OpJ water containing 1 25u.g/ml puromycin is also added to the 25uJ non-emulsified control reactions. 25u.l 
15 of each of the samples is then precipitated with acetone and run on a 20% SDS-PAGE gel according to the instructions 
given by the manufacturers of the in vitro transcription/translation system (Promega). The gel is dried and scanned 
using a Phosphorlmager (Molecular Dynamics). A single strong band is seen with the expected molecular weight of 
DHFR (18 kd) in both the reactions performed in emulsions and in the controls. This band is accurately quantified. 
[01 53] In the emulsified reactions the mean area under the 1 8kd peak is 1 5,073 units whereas the mean area under 
20 the same peak in the non-emulsified control reactions is 18,990 units. Hence, in the emulsified reactions the amount 
of DHFR protein is calculated to be 79% that found in the nonemulsif ied control reactions. This therefore indicates that 
the transcription/translation system is functional in the water-in-oil emulsion system of the present invention. 

Example 4. 

25 

Din yd ro folate reductase produced using the coupled in vitro transcription/translation reactions Is active. 

[0154] Here it is shown that protein (£T. coli dihydrofolate reductase) can be produced efficiently in a catalytically 
active form by coupled transcription/translation of the fo/A gene in the aqueous microcapsules of a water-in-oil emulsion 
system. In this assay, an emulsion comprising microcapsules below optimal size is used; DHFR activity is shown to 
be higher in the larger microcapsule sizes. 

[0155] 175uJ translation reactions (unlabelled) are set up on ice containing either 2 x 10 11 , 6 x 10 12 or 1.8 x 10 12 
molecules of the folA template DNA used in Example 3, or no DNA. T7 RNA polymerase (6x1 0 3 units) are added to 
each reaction to drive transcription. 

[0156] A 1 0OuJ aliquot of each reaction is added to 1 .9ml Sigma light mineral oil containing 4.5% Span 80 and 0.5% 
Tween 80 and homogenised for 1 minute or 5 minutes with an Ultra-Turrax T25 Homogeniser equipped with an 8mm 
diameter dispersing tool, at 20,000 rpm as in Example 1. After homogenisation for 1 minute the mean diameter of 
particles (by volume) is 1 .30um (median 1 .28)u.m). 98% by volume of the internal (aqueous) phase is present in particles 
varying from 0.63um to 2.12u.m. After homogenisation for 5 minutes the mean diameter of microcapsules (by volume) 
is 0.81 u.m (median 0.79um) and 98% by volume of the internal (aqueous) phase is present in particles varying from 
0.41umto1.38u.M. 

[01 57] Based on the mean microcapsule volume in the 1 minute emulsions (1.1 x 1 0~ 18 m 3 for a 1 .299 u.m diameter 
microcapsule) the 1 OOuJ reaction would be divided into 8.7 x to 10 microcapsules). Hence, there should be roughly 1 .3, 
3.9 or 11 .8 molecules of DNA per microcapsule. 

[0158] Based on the mean microcapsule volume in the 5 min emulsions (2.8 x 10 19 M 3 for a 0.81 \xm diameter 
microcapsule) the 1 0Oiil reaction would be divided into 3.6 x 10 11 microcapsules). Hence, there should be roughly 0.3, 
1 .0 or 2.9 molecules of DNA per microcapsule. 

[0159] The emulsions, and the non-emulsified reaction mix are incubated in a 25°C water bath. 0.5 ml samples of 
the emulsion are removed immediately before the start of the incubation and after 2 hours and placed on ice. 25\i\ 
samples are removed from the non-emulsified control reactions at the same times. 

[0160] The emulsions are spun in a microfuge at 13,000 r.p.m. for 5 min. at 4°C and the mineral oil removed by 
aspiration, leaving the concentrated (but still intact) emulsion at the bottom of the tube. After briefly re-spinning and 
removing any further mineral oil the emulsion is broken and any further translation stopped by adding 1 0Ojil Buffer A 
( 1 00 mM Imidazole pH 7.0, 1 0 mM p-mercaptoethanol), containing 1 25u.g/ml puromycin and 1 ml water saturated diethyl 
ether. The mixture is vortexed and spun in a microfuge at 13,000 r.p.m. for 1 min. at 4°C. The ether and dissolved 
mineral oil is removed by aspiration and the extraction repeated with a further 1ml of ether. Any remaining ether is 
driven off by spinning for 5 minutes in a Speedvac at room temperature. 1 0OuJ Buffer A containing (125jig/ml) puromycin 
is also added to the 25uJ non-emulsified control reactions. 
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[0161] Dihydrofolate reductase activity is assayed as by spectrophotometrically monitoring the oxidation of NADPH 
to NADP at 340nm over a 10 minute time course as described by Williams et ai, 1979; Ma et ai, 1 993. 10u1 of each 
quenched in vitro translation reaction is added to 1 50uJ Buffer A (100 mM Imidazole.. pH 7.0, 10mM 0-mercaptoethanol) 
and 20u.l 1mM NADPH. 20uJ Dihydrofolate (1mM)(H 2 F) is added after 1 minute and the reaction monitored at 340nm 
using a ThermoMax microplate reader (Molecular Devices). Activity is calculated by initial velocities under So»K M 
conditions (x> max ). The background activity in the S30 extract is subtracted from all samples. 

[0162] DHFR activity generated in the emulsions is taken from the difference in activity measured at 0 hours and 2 
hours incubation. No increase in NADPH oxidation occurred between the 0 hour and 2 hour samples when 0.1 uA/l 
methotrexate (a specific inhibitor of DHFR) is added showing that all the increase in NADPH oxidation observed is due 
to DHFR produced in the in vitro translation reactions. 

[0163] Using 1 minute homogenisation at 20,000 rpm, the DHFR activity generated in the emulsions is 31% that 
found in the non-emulsified control reactions with 1 .3 molecules of DNA per microcapsule; 45% with 3.9 molecules of 
DNA per microcapsule; and 84% with 11.8 molecules of DNA per microcapsule. 

[0164] Using 5 minute homogenisation at 20,000 rpm, the DHFR activity generated in the emulsions is 7% that found 
in the non-emulsified control reactions with 0.3 molecules of DNA per microcapsule; 15% with 1 molecule of DNA per 
microcapsule; and 35% with 2.9 molecules of DNA per microcapsule, on average. 

[0165] Assuming the turnover number of DHFR is as described by Posner et ai, 1996, this corresponds to a yield 
at the highest DNA concentration of 6.3uxj (340pmole) DHFR per 100uJ reaction (non-emulsified control), 1.98ug 
(104pmole) DHFR per 100u.l reaction (emulsified for 1 min), or 0.46u.g (24.8pmo!e) per 1O0u.l reaction (emulsified for 
5 minutes). This equates to 74 molecules DHFR per microcapsule in the 1 minute emulsions and 44 molecules per 
microcapsule in the 5 minute emulsions (assuming that all microcapsules are of mean size). 

[0166] The DHFR activity resulting from coupled transcription/translation of fo/A genes is also measured in the larger 
microcapsules produced by stirring alone : or by stirring followed by further homogenisation with an Uftra-Turrax T25 
disperser at 8,000 rpm, 9,000 rpm, or 13,500 rpm for 1 minute as described in Example 1 . The results are presented 
in Figure 2b. The concentration of fo/A genes used (2.5 nM) gives an average of 1 , 1.5 and 4.8 genetic elements per 
droplet in the emulsions homogenised at 13,500 rpm, 9,500 rpm and 8,000 rpm, respectively, and an average of 14 
genetic element per droplet in the emulsion prepared by stirring only. Addition of sodium deoxycholate (0.5%) to the 
in vitro translation reaction mixture does not significantly affect the DHFR activity observed in the broken emulsions. 



30 Example 5. 



Linkage of an immobilised substrate into a genetic element via a high molecular weight protein. 

[0167] In order to link multiple immobilised substrate molecules to a DNA fragment comprising the folk gene, the 
DNA fragment is first biotinylated and then coupled to a complex of avidin with apoferritin. Horse spleen apoferritin is 
a large, near spherical protein molecule of 12.5nm diameter which therefore provides multiple sites which can be 
derivatised with substrate (e.g. the e-amino group of surface lysines). The pGEM fo/A plasmid encoding E. coU DHFR 
is PCR amplified using the primers LMB3 and S'-biotinytated LMB2 (LMB2-Biotin) to create a biotinylated 649bp DNA 
fragment carrying the T7 RNA polymerase promoter, the phage T7 gene 1 0 translational start site and the fotA gene 
(see Example 3). The DNA is radiolabeled by supplementing the 500u,l PCR reaction mix with 100u.Ci [ct- 32 P]dCTP 
(Amersham; 3000 Ci/mmol). The biotinylated PCR fragment is purified directly using Wizard PCR Preps (Promega) 
and the concentration determined spectrophotometrically. The percentage of DNA biotinylated is assayed by binding 
to Streptavidin M-280 Dynabeads (Dynal) and scintillation counting. 83% of the DNA is determined to be biotinylated 
using this technique. 

[0168] The sequestered iron is removed from a commercial conjugate of avidin and ferritin (Avidin-Ferritin; approx, 
1.1 mole ferritin per mole avidin; Sigma) by the overnight dialysis (4°C) of a solution of avidin-ferritin in PBS (1 mg/ml) 
against 0.1 2M thioglycollic acid, pH 4.25, followed by 24 hours dialysis against PBS (4°C) as described by Kadir and 
Moore, 1 990. Removal of iron is checked by analysis of the absorbance spectra (sequestered Fe(lll) absorbs strongly 
at 310-360nm). 

[0169] 0.3 pmole radiolabeled, biotinylated DNA is incubated with varying molar ratios of avidin-apoferritin in PBS 
(total volume 9uJ) for 30 minutes at room temperature. A 4.5jaI aliquot is removed and the percentage of DNA complexed 
with avidin-apoferritin assayed using band-shifting assay on a 1 .5% agarose gel as described by Berman et ai, 1 987. 
The gel is then dried and scanned using a Phosphorlmager (Molecular Dynamics). The percentage of DNA remaining 
unshifted (i.e. not complexed with avidin-apoferritin) is 1 7% (1 : 1 molar ratio avidin-apoferritin: DNA) , 1 5% (5: 1 molar 
ratio avidin-apoferritin:DNA) or 14% (25:1 molar ratio avidin-apoferritin: DNA). This means that even at a 1:1 ratio of 
avidin-apoferritin: DNA basically all the biotinylated DNA is bound. No band-shifting is observed when biotinylated DNA 
is mixed with apoferritin or when non-biotinylated DNA is mixed with avidin-apoferritin. 

[0170] The remaining 4.5uJ of DNA complexed with avidin-apoferritin is used as the template for a 25u.i in vitro tran- 
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scriptionAranslation reaction (E. coli S30 Extract System for Linear Templates; Promega). After 2 hours at 25°C, the 
reaction is stopped by adding 100uJ Buffer A containing puromycin (125u.g/ml). Dihydrofotate reductase activity is 
assayed as above by spectrophotometries!^ monitoring the oxidation of NADPH to NADP at 340nm over a 10 minute 
time course. 

5 [0171] 10uJ of each in vitro translation reaction is added to 150uJ Buffer A and 20uJ NADPH (1 mM). 20uJ Dihydrofolate 
( 1 mM) (Emulsions were broken and reactions were stopped with 0.5 ml EDTA (50 mM) and 2 ml water-saturated diethyl 
ether as described in Example 1) is added after 1 minute and the reaction monitored at 340nm using a ThenmoMax 
microplate reader (Molecular Devices). No difference in DHFR activity is found at even the highest ratio avidin-apof- 
erritin:DNA compared to a control with no avidin-apoferritin added. This indicates that the vast majority of DNA can be 

10 complexed without compromising the efficiency of in vitro translation. 

Example 6. 

Both in vitro transcription-translation and DHFR activity are compatible in the same system. 

15 

[0172] In order to select for the activity of DHFR produced in situ by coupled transcription -translation both the tran- 
scription-translation reaction and DHFR must be active in the same buffer system. 

[0173] A direct assay for DHFR activity in a complete E. coti in vitro translation system based on the spectrophoto- 
metrically monitoring of the oxidation of NADPH to NADP at 340nm is not practical due to the turbidity of the S30 
20 extracts. 

[0174] However, it is possible to ascertain that DHFR is active in the same buffer system as in vitro translation. E. 
coli DHFR is obtained by IPTG-induction of bacteria containing the plasmid pGEM-fo/A and affinity-purified on a meth- 
otrexate-Seph arose column (Baccanari etai, 1977). 

[0175] DHFR activity is compared in Buffer A as above or in an in vitro translation mixture complete except for the 
25 substitution of S30 dialysis buffer (Lesley 1995) (tOmM Tris-acetate pH8.0, 14mM magnesium acetate, 60mM potas- 
sium acetate, 1 mM DTT) for the S30 fraction. In each case the total reaction volume is 200^1 and the concentration of 
NADPH and Emulsions were broken and reactions were stopped with 0.5 ml EDTA (50 mM) and 2 ml water-saturated 
diethyl ether as described in Example 1 each 0.1mM. Reactions are monitored spectrophotometrically at 340nm. Ad- 
dition of 1 75pmole (1 3mUnits) E. coli DHFR gives initial rates of -25.77 mOD/min (in Buffer A) and -11.24 mOD/min 
30 (in translation buffer), hence the reaction is 44% as efficient in the translation buffer as in an optimised buffer (buffer A). 
[01 76] Furthermore, the presence of the substrates of DHFR (NADPH and H 2 F) at 0.1 mM concentration (either alone 
or in combination) does not cause any inhibition of the production of active DHFR from a 2 hour coupled transcription- 
translation reaction. 

35 Example 7. 

The activity of DHFR on a genetic element containing an immobilised dihydrofolate substrate leads to the 
formation of a tetrahydrofolate product linked to nucleic acid encoding DHFR. 

40 [0177] A peptide is synthesised comprising three glutamic acids linked via their y-caboxylates (using N-fluorenyl- 
methoxycarbonyl-giutamic acid a-benzyl ester as a starting material) with a lysine at the carboxy-terminus and biotin 
linked to its e-amino group by modifying published procedures (Krumdiek et ai, 1 980). Folic acid is linked at the amino- 
terminus and the benzyl and trifiuoroacetamide protective groups removed by alkaline hydrolysis as previously de- 
scribed. The peptide is purified by reverse phase HPLC and characterised by mass and UV spectroscopy. This folic 

45 acid peptide is chemically reduced to the corresponding dihydrofolic acid peptide (using dithionate and ascorbic acid) 
and then to the corresponding tetrahydrofolic acid peptide (using sodium borohydride) by applying published proce- 
dures (Zakrzewski et ai, 1980). These transformations are characterised by UV spectroscopy. 
[0178] A genetic element is constructed by linking, on average, two to three molecules of the folic acid peptide to 
avidin (or streptavidin) together with one molecule of the DHFR encoding, PCR-amplified DNA from the plasmid pGEM- 

50 fofA using primers LMB2-Biotin (SEQ. ID. No. 9) and LMB3 (see Example 3). The immobilised folic acid is chemically 
reduced to dihydrofolate using dithionate and ascorbic acid and purified by dialysis against buffer A. E. coli DHFR is 
obtained by IPTG induction of bacteria containing the plasmid pGEM-/o/A and affinity purified on a methotrexate-Sepha- 
rose column. E. coli DHFR is shown to react with the dihydrofolic acid immobilised to this genetic element by monitoring 
the oxidation of NADPH to NADP spectrophotometrically using 0-10 n_M of the avidin-linked dihydrofolic acid peptide 

55 and 0-50uM NADPH. Hence, at the end of this reaction, the product tetrahydrofolate is linked to the folA gene which 
encodes for the enzyme (i.e., DHFR) that catalyses its formation. 

[0179] To isolate those genes attached to the tetrahydrofolate product there are two approaches. The first involves 
the generation of phage-display antibodies specific for tetrahydrofolate (Hoogenboom, 1997). The second approach 
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is based on the use of a tagged reagent which reacts specifically with the immobilised product, but not with the substrate. 
We have synthesised a molecule consisting of a dinitrophenyl (DNP) tag linked to benzaldehyde via a 14 atom spacer. 
The aldehyde group reacts specifically with tetrahydrofolate to form a covalent adduct (Kallen and Jencks r 1966) and 
affinity purification can be performed using an anti-DNP antibody. 

Example 8. 

An alternative method of selecting for DHFR activity 



[0180] The DHFR-catalysed reaction can be selected for by in situ coupling to a second reaction, catalysed by Yeast 
Aldehyde Dehydrogenase, with a 'tagged* substrate. 

[0181] Instead of selecting for genes connected to one of the products of the DHFR reaction (5,6,7,8-tetrahydrofolate 
or NADP + ) the DHFR reaction is coupled to a second reaction. Selection is in this case is mediated by the formation 
of the second product of the DHFR-cataJysed reaction - nicotinamide adenine dinucleotide phosphate (NADP+). 
[0182] The reaction we have chosen to couple is catalysed by Yeast Aldehyde Dehydrogenase (YAD; EC 1 .2.1.5). 
This enzyme uses either NAD+ or NADP+ in the oxidation of a wide range of aliphatic and aromatic aldehydes to their 
corresponding carboxylics acids, generating NADH or NADPH in the process. The reaction has the big advantage of 
being essentially irreversible - namely, dehydrogenases (including DHFR and YAD) do not catalyse the reduction of 
the acid back to the aldehyde. Since a large number of enzymes catalysing redox reactions generate NAD* or NADP + 
the YAD reaction can be used in the selection of these enzymes too, and is not limited solely to selection for Dihydro- 
folate Reductase. 

[0183] A pentaldehyde substrate is synthesised and linked to a DNP (dinitrophenyl) tag via a C 20 linker (hereafter, 
DNP-PA). The oxidation of DNP-PA to the conesponding acid (DNP-PC) is followed and by HPLC (reverse phase C }Q 
column; H 2 0/CH 3 CN gradient + 0.1% trifluoroacetic acid; retention times: DNP-PA, 5.0 mins; DNP-PC, 4.26 mins). 
25 Conversion of DNP-PA to DNP-PC is observed only in the presence of both YAD and NADP+. Reactions are also 
followed spectrophotometrically; the increase of absorbance at 340nm indicated that NADP+ is simultaneously con- 
verted to NADPH. 

[0184] The coupled DHFR-YAD reaction is followed using the same HPLC assay. The initial reaction mixture con- 
tained the substrates for DHFR - NADPH (50 pM) and 7-8-dihydrofoJate (H 2 F; 50 U M), YAD (Sigma, 0.5 unit) and 
30 DNP-PA (50 u.M) in buffer pH 7.7 (100 mM imidazole, 5 mM 0-mercaptoethanol and 25 mM KCI). Conversion of DNP-PA 
to DNP-PC is observed when DHFR is added to the above reaction mixture (DHFR 5 nM. 83 %.; 1 .25 nM. 1 4.5 % after 
32 mins). 

[0185] The concentration of DHFR obtained in the compartmentalised in vitro translation is in fact much higher than 
5 nM (see Example 4). The conversion of DNP-PA to DNP-PC is negligible in the absence of DHFR, or when meth- 
otrexate (MTX) - a potent inhibitor of the enzyme - is present (1 0 uJvl). Hence, the formation of the secondary product, 
DNP-PC, is therefore linked to the presence of the DHFR. 

[0186] Using this coupled reaction, proteins conferring DHFR activity can be selected by: i) linking the genes to 
antibodies that specifically bind the carboxylic product of DNP-PA, and ii) isolating these genes by affinity purification 
using an anti-DNP antibody. 

[0187] This approach is demonstrated by a routine immuno assay based on the cat E LISA (Tawfik et aL, 1993). 
Microtiter plates are coated with anti-rabbit immunoglobulins (Sigma, 10 fig/well) followed by rabbit polyclonal serum 
that specifically bind glutaric acid derivatives (Tawfik et aL, 1993) diluted 1:500 in phosphate saline buffer + 1 mg/ml 
BSA). The plates are rinsed and blocked with BSA. The coupled reaction mixtures described above are diluted in Tris/ 
BSA buffer (50 mM Tris, .150 mM sodium chloride, 10 mg/ml BSA, pH 7.4) and incubated for 1 hr. The plate is rinsed 
and an anti-DNP antibody (mouse monoclonal SPE21 .11) diluted in the same buffer (1:1 0,000) is added and incubated 
for an hour. The plate is rinsed and peroxidase labelled anti mouse antibody (Jackson) is added followed by a perox- 
idase substrate (BM Blue; Boehringer Mannheim). A specific signal is observed only in the coupled reactions samples 
that contained DHFR (in addition to H 2 F, NADPH, YAD and DNP-PA). 

[0188] Highly specific anti-carboxylic acid antibodies (Tawfik et aL, 1993) are used for selection in two formats. 
[01 89] In the first, the anti-carboxylic acid antibody is coupled chemically to a high molecular weight avidin (or strepta- 
vidin) containing complex such as that described in Example 5. Biotinylated DNA encoding DHFR is coupled to this 
complex wathe avidin-biotin interaction as described in Example 5. This complex is then used in a compartmentalised 
coupled transcription/translation system which also contains YAD and a tagged YAD substrate such as DNP-PA. If 
there is DHFR activity in the compartment the DNP-PA is converted to DNP-PC. The anti-carboxylic acid antibodies, 
coupled to the DNA via the high molecular weight complex will capture only DNP-PC molecules and not aldehyde 
molecules. DNA from those compartments containing active DHFR (and hence encoding active DHFR if there is only 
one molecule of DNA per compartment) are then affinity purified by using anti-DNP antibodies. 
[01 90] In the second format multiple streptavidin molecules are coupled together in a high molecular weight complex 



21 

i 

BNSDCCID: <EP J 482036A2 J_> 



EP 1 482 036 A2 



which can easily be coupled to biotinylated DNA encoding DHFR (see Example 5). This complex is used in a compart- 
mentalised coupled transcription/translation system which also contains YAD and a YAD substrate such as MeNPOC- 
biotin-benzaldehyde. The biotin group in MeNPOC-biotin-benzaldehyde is "caged" (Sundberg etai., 1995; Pirrung and 
Huang, 1 996), that is, it cannot be bound by avidin or streptavidin until a photoremovable nitrobenzyi group'has been 

5 cleaved off by irradiation with light. If there is DHFR activity in the compartment the MeNPOC-biotin-benzaldehyde is 
converted to MeNPOC-biotin-benzoic acid. After the compartmentalised reaction has run for a while the reaction is 
irradiated with light and the nitrobenzyi group removed and the compound will bind to the streptavidin-DNA complex. 
DNA in those compartments containing active DHFR (and hence encoding active DHFR if there is only one molecule 
of DNA per compartment) is compiexed with biotin-benzoic acid (instead of biotin-benzaldehyde) and can be affinity 

*o purified using immobilised anti-benzoic acid antibodies. 

[0191] The presence of other enzymes which can catalyse the oxidation NAD + or NADP+ to NADH or NADPH in the 
in vitro transcription/translation system can under certain circumstances make it difficult to use this YAD system for 
selection directly in the compartmentalised in vitro transcription/translation system. In this case the selection is carried 
out using the two-step compartmentalisation system described earlier. That is, the DHFR is first translated in compart- 

'5 ments and then linked to the DNA in the same compartment by means of a suitable affinity tag. The emulsion is broken, 
the contents of the compartments pooled and the DNA affinity purified away from the other components of the tran- 
scription/translation system (including contaminating oxido-reductases), by using antibodies specific to a digoxigenin 
'tag' attached to one end of the DNA molecule. The purified DNA molecules ! together with the attached DHFR protein 
are then put into a reaction mixture contained the substrates for DHFR - NADPH (50 u,M) and 7-8-dihydrofolate (H 2 F; 

20 50 u.M), YAD (Sigma, 0.5 unit) and DNP-PA (50 \M) in buffer pH 7.7 (100 mM imidazole, 5 mM p-mercaptoethanol and 
25 mM KCI) and the reaction re-compartmentalised by emulsification to give only one, or at most a few, molecules of 
DNA per compartment. Anti-carboxylic acid antibodies (Tawfik et al., 1 993) are used for selection in either of the two 
formats described above. 

25 Example 9. 

Methylation of genetic elements by gene products 

[0192] DNA methyltransferases, produced by in vitro transcription/translation in the aqueous compartments of a 
30 water-in-oil emulsion, methylate the DNA molecules which encode them in the compartments. 

[0193J Selecting proteins with binding or catalytic activities using the compartmentalisation system described here 
presents two basic requirements: i) a single molecule of DNA (or at most a few molecules) encoding the proteins to 
be selected is expressed in a biologically active form by a coupled transcription/translation system in the aqueous 
compartments of a water-in-oil emulsion; and, ii) the protein to be selected must be able to modify the genetic element 
35 that encoded it in such a way as to make it selectable in a subsequent step. In this Example, we describe a group of 
proteins - DNA methyl transferases (type II) - that are produced efficiently in the aqueous compartments of a water- 
in-oil emulsion system using a coupled transcription/translation system. Furthermore, the in vitro translated DNA meth- 
yltransferases efficiently modify the DNA molecules which encode them in situ in the aqueous compartments so that 
they can be selected and amplified. The target sites on the DNA molecules are modified by methylation of a cytosine 
*o at the C5 position which renders the sites resistant to cleavage by the cognate restriction endonuclease (i.e. Hha\ tor 
M.Hhal and HaelH for M. Haelll). Hence, methylated DNA is selectable over non-methylated DNA by virtue of its re- 
sistance to restriction endonuclease cleavage. 

[0194] The gene encoding M.Hhal is amplified by PCR using oligonucleotides Hhal-Fo2S and Hhal-Bc directly from 
Haemophilus parahaemolyticus (ATCC 10014). The gene encoding M. Haelll is amplified by PCR using oligonucle- 

45 otides Haelll-Fo2s and Haelll-Bc (SEQ. ID. No. 4) directly from Haemophilus influenzae (biogroup aegyptius) (ATCC 
11116). Both PCR fragments are cloned into the vector pGEM-4Z (Promega) digested with Hind\\\ and Kpn\ downstream 
of the lac promoter andT7 RNA polymerase promoter. The oligonucleotides Hhal-Bc and Haelll-Bc (SEQ. ID. No. 4) 
append the efficient phage T7 gene 10 translation^ start site upstream of the methyltransferase gene start codon. 
Oligonucleotide Hhal-Fo appends an Hha\ methylation/restriction site (M/R) and a Haelll (/A/ofl) site to function as 

50 substrates for M.Hhal and M.Haelll respectively. Oligonucleotide Haelll-Fo appends a Not\/Hae\\\ M/R site which func- 
tions as a substrate for M.Haelll (the M. Haelll gene already contains two internal Hha\ M/R sites). DNA sequencing 
identifies clones with the correct nucleotide sequence. 

[0195] The pGEM-M.Hfial and pGEM-M. Haelll plasmids described above are amplified by PCR using primers 
LMB2-Biotin (SEQ. ID. No. 9) and LMB3-DIG (SEQ. ID. NO. 10) as above to create either 1167 base pair DIG-M. 
55 Hhal-Biotin or a 11 71 base pair DIG-M. Haelll-Biotin DNA fragment, labelled at one end by biotin and the other end by 
digoxigenin, and which carry the T7 RNA polymerase promoter, the phage T7 gene 1 0 translational start site, the 
methyltransferase gene and M/R sites of Haelll and Hhal. The PCR fragments are each purified directly using Wizard 
PCR Preps (Promega). 
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[0196] The genes required for the coupled in vitro transcript ion -translation of M EcoRI and M.EcoRV are amplified 
by PCR using piasmids pMBI (Betlach eta!., 1976) and pLB1 (Bougueleret et at., 1984) respectively, as templates, a 
back primer appending the phage T7 gene 10 translational start site and LMB3 upstream of the methyltransferase 
gene ribosome binding site (EcoRI-Bc or EcoRV-Bc) and a forward primer (EcoRI-Fo or EcoRI-Fo) appending LMB2. 
These fragments are further amplified by PCR using primers LMB2-Biotin (SEQ. ID. No. 9) and LMB3-DIG (SEQ. ID. 
NO. 10) as described above to create the DIG-M.Eco Rl-Biotin and DIG-M.EcoRV- Biotin DNA fragments which carry 
the T7 RNA polymerase promoter, the phage T7 gene 70translationaJ start site, the methyltransferase gene and M/R 
sites of EcoRI and EcoRV. These PCR fragments are each purified directly using Wizard PCR Preps (Promega). 
[0197] The PCR-amplified DNA-methylases genes described above are expressed in a prokaryotic in vitro coupled 
transcription/translation system designed for linear templates (Lesley et at., 1991). A commercial preparation of this 
system is used (E. coti S30 Extract System for Linear Templates; Promega) supplemented with T7 RNA polymerase 
and S-adenosyl methionine (SAM) at 80 jiM concentration. 

[0198] Methylation is assayed by measuring the resistance of DNA fragments labelled with DIG and biotin to cleavage 
by the cognate restriction enzyme using the Boehringer-Mannheim DIG-Biotin EL ISA or with radioactively labelled 
DNA fragments and streptavidin coated magnetic beads. In vitro reaction mixtures containing DIG-Biotin labelled frag- 
ments reacted in situ by coupled in vitro transcription-translation as described below are diluted in 1 xW&B buffer (1 M 
NaCI, 10 mM Tris, 1 mMEDTA, pH 7.4) + 0.1% Tween-20 (the concentration of the DIG/Biotin labelled DNA in the 
assay is in the range of 0-250 pM) and incubated in streptavidin coated microtiter plates (high capacity) for 30-60 mins. 
The plate is rinsed (3 times 2xW&B and finally with 50 mM Tris pH 7.4 + 5 mM MgCI 2 ) and the restriction enzymes 
(NEB) are added (10-50 units enzyme in 0.2 ml of the corresponding buffer) and incubated at 37° for 3-12 hrs. The 
plate is rinsed and peroxidase-linked anti-DIG antibodies (diluted 1:1,500 in PBS + 0.1% Tween-20 + 2 mg/ml BSA) 
are added for 40-60 min followed by the peroxidase substrate (BM Blue; 70 fil/well). The absorbance (at 450 minus 
650nm) is measured after quenching with 0.5M H 2 S0 4 (130 jil/well). 

[0199] For the radioactive assay, the piasmids and PCR fragments described above are amplified by PCR using 
primers LMB2-Biotin (SEQ. ID. No. 9) and LMB3 and oc-P^-CTP to give P^-labelled DNA fragments labelled at one 
end by biotin and which carry the T7 RNA polymerase promoter, the phage T7 gene 10 translational start site, the 
methyltransferase gene and the relevant M/R sites. These PCR fragments are purified directly using Wizard PCR 
Preps (Promega). Reaction mixtures containing the Biotin- P32.| a belled DNA reacted in situ by coupled in vitro tran- 
scription-translation are diluted in 1xW&B buffer + 0.1% Tween-20 and incubated with streptavidin coated magnetic 
30 beads (Dynal, M-280; 1-5 x10 6 beads) for 30-60 mins. The beads are separated and rinsed (3 times 2xW&B + 0.1% 
Tween-20 + 3% BSA and finally with 50 mM Tris pH 7.4 + 5 mM MgCI 2 ). The restriction enzymes (NEB) are added 
(10-50 units enzyme in 50-150 \i\ ofthe corresponding buffer) and incubated at 37° for 5-20 hrs. The supernatant is 
removed and the beads rinsed and resuspended in 100 u,l water. The amount of radioactively-labelled DNA on the 
beads and in the supernatants is determined by scintillation. 

[0200] All four methylases described here - M.Haelll, M.tfhal, M.EcoR and M.EcoRV - are expressed and active in 
the in wfrocoupled transcription/translation. Furthermore, the in vitro translated methylase can methylate its own gene 
thus rendering it resistant to cleavage by the cognate methylase (self-methylation). Both processes, the coupled in 
vitro transcription-translation of the methylase gene as well as its methylation proceed efficiently in the same reaction 
mixture. More specifically, DNA fragments (at 0.5 to 10 nM concentrations) which carry the T7 RNA polymerase pro- 
moter, the phage T7 gene 10 translational start site, a methyltransferase gene and M/R sites of all four methylases 
become resistant to cleavage by the cognate restriction endonuclease. For example, the DNA fragment encoding M. 
EcoRI methyltransferase becomes resistant to cleavage by EcoRI (75-100% after 20-90 minutes at 25°C) when incu- 
bated with E. coti S30 Extract System for Linear Templates (Promega), SAM (80 uJvl) and T7 RNA polymerase. The 
resistance to cleavage as a result of methylation is selective and specific: under the same conditions, resistance to 
cleavage by Hha\ or M. EcoRV is not observed; moreover, resistance to cleavage by EcoRI is not observed when 
translation is inhibited (e.g. in the presence of puromycin or in the absence of T7 RNA polymerase). Similar results 
where obtained when survival of the genes is assayed by DIG-Biotin ELISA or with Biotin-P 32 -labelled DNA fragments 
as described above. Methylation in trans, i.e., of DNA fragments (other than those encoding for the cognate methylase) 
appending M/R sites is also observed in the E. coli S30 coupled in vitro transcription-translation system in the presence 
50 of a gene encoding for a methylase. 

[0201] Both processes, the coupled in vitro transcription-translation of the methylase genes as well as their self- 
methylation proceed efficiently in the aqueous compartments of a water-in-oil emulsion. More specifically, DNA frag- 
ments (at 0. 1 - 1 0 nM concentrations) which carry the T7 RNA polymerase promoter, the phage T7 gene 10 translational 
start site, the methyl transferase gene (for example, M.Hnal) and the M/R sites of Haelll, Hha\ and EcoRI are added 
55 to E. co//S30 Extract System for LinearTemplates (Promega) in the presence of SAM (80 \xU) andT7 RNA polymerase. 
The ice cooled reaction mixtures are emulsified by homogenising for 1 minute with an Ultra-Turrax T25 disperser at 
20,000 rpm as described in Example 1 and incubated at 25° -30° for 0-180 mins. The reaction is stopped and the 
aqueous phase is separated (see Example 1) and the methylation of the DIG-Biotin or Biotin -P 32 - labelled DNA frag- 
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ments is assayed as described above. Methylation of up to 20% of the compartmentalised genes to cleavage by Hha\ 
is observed after 60-180 mins incubation. No resistance is observed when the ice-cold emulsion is broken just after it 
is made and the reaction quenched by ether extraction ('0 mins'). The methylation is selective: under the same condi- 
tions, resistance to cleavage by Haelll or EcoRI is not observed. Moreover, the assay of P^-labelled DNA fragments 
has shown that self-methylation of both M.Haelll and M.Hnal proceed at concentrations of genes that correspond to 
an average of less than one gene per compartment (0. 1 -0.5 nM; see Example 4). Thus, the coupled in vitro transcription- 
translation of the methylases genes as well as their self-methylation proceed efficiently even when only a single genetic 
element is present in aqueous compartments of the water-in-oil emulsion. 

[0202] Haelll methylase activity resulting from coupled transcription/translation of M.Haelll genes is also measured 
in the larger microcapsules produced by stirring alone, and by stirring followed by further homogenisation with an Ultra- 
Turrax T25 disperser at 8,000 rpm, 9,000 rpm, or 1 3,500 rpm for 1 minute as described in Example 1 . The results are 
presented in Figure 2b. The concentration of M.Haelll genes used (2.5 nM) gives an average of 1 : 1 .5 and 4.8 genetic 
elements per droplet in the emulsions homogenised at 13,500 rpm, 9,500 rpm and 8,000 rpm, respectively, and an 
average of 14 genetic elements per droplet in the emulsion prepared by stirring only. The addition of an anionic sur- 
factant - e.g. , sodium deoxycholate, typically to 0.5% (w/v), to the in vitro translation mixture significantly increases the 
percentage of genetic elements methylated in the emulsions. 



Example 10. 

Genetic elements encoding DNA methyltransferases can be selected and amplified following their self- 
methylation in the aqueous compartments of a water-in-oil emulsion. 

[0203] The methylation of genes encoding for DNA-methylases allows them to be isolated and amplified in a sub- 
sequent step. The methylated DNA is selectable over non-methylated DNA by virtue of its resistance to restriction 
endonuclease cleavage. Thus, the genetic elements that remain intact after treatment with the cognate restriction 
enzyme can be amplified by PCR. However, such a selection is obviously unattainable if other genes that contain the 
same R/M site but do not encode for the methylase are present in same reaction mixture. This is because cross- 
methylation of the non-methylase genes (that are present at a large excess) will render them resistant to cleavage by 
the cognate restriction enzyme and thus amplifiable by PCR. Under these conditions, selection of genes encoding the 
methylase will become possible only if they are compartmentalised - namely, if only one, or few genes are present in 
a single compartment so that self methylation is the major process in that compartment. Cross-methylation is avoided 
since non-methylase genes that are present in compartments that do not contain a methylase gene will remain un- 
methylated. 

[0204] The genes used in the experiment are a 1 1 94 base pair M.Haelll fragment (DIG-M.Hae!ll-3s-Biotin) encoding 
methylase Haelll and a 681 base pair folA fragment (DIG fo/A-3s-Biotin) encoding the enzyme dihydrofolate reductase 
(DHFR) containing additional Haelll and Hha\ restriction/modification sites (See Fig. 1b). Both DNA fragments are 
labelled at one end with digoxigenin (DIG) and the other with biotin, and contain a T7 RNA polymerase promoter (T7 
Promoter) and T7 gene /Otranslational initiation site (rbs) for expression in vitro. 

[0205] pGEM-4Z-3s is created by annealing oligonucleotides HaeHha-PI and HaeHha-Mi (SEQ. ID. No. 2) (Table 1 ) 
and ligating them into H/ndlll and EcoRI cut pGEM-42 (Promega). The M.HaelU gene is amplified by PCR from Hae- 
mophilus influenzae (biogroup aegyptius) (ATCC 11116) using oligonucleotides Haelll-FoNC (SEQ. ID. No. 3) and 
Haelll-Bc (SEQ. ID. No. 4) (Table 1 ). The fo/A gene is amplified from Escherichia coli using primers EDHFR-Fo (SEQ. 
ID. No. 5) and EDHFR-Ba (SEQ. ID. No. 6) (Table 1). Both amplified genes are digested with H/ndlH and Kpn\ and 
cloned into pGEM-4Z-3s, creating the expression vectors pGEM-Haeill-3s and pGEM fojA-3s. DIG-M.Haelll-3s-Biotin 
and DIG-to/A-3s-Biotin (see Fig. 1b) are amplified from these vectors by PCR with Pfu polymerase using primers 
LMB2-Biotin (SEQ. ID. No. 9) and LMB3-DIG (SEQ. ID. NO. 10) (20 cycles) and purified using Wizard PCR Preps 
(Promega). DIG-D1.3-Biotin, a 942 bp DNA fragment containing four Haelll R/M sites used as a substrate to assay 
Haelll methylase activity, is amplified from a pUC 1 9 derivative containing a D1 .3 single-chain Fv gene (McCafferty et 
at., 1 990) as above. A 558 bp carrier DNA (g3 carrier DNA; an internal fragment of phage fd gene 111 which has no T7 
promoter, Haelll or Hha\ R/M sites) is amplified by PCR with Taq polymerase from pHEN1 DNA (Hoogenboom et a\., 
1991) using primers G3FRAG-Fo (SEQ. ID. No. 11) and G3FRAG-Ba (SEQ. ID. No. 12) (Table 1) and purified by 
phenol-chloroform extraction and ethanol precipitation. This DNA (at >10 nM) was used as a carrier in dilution of all 
DNA used for the reactions in this example. 

[0206] Figure 3 demonstrates the selection of M.Haelll genes encoding the DNA methylase Haelll from an excess 
of fo/A genes (encoding DHFR which does not methylate DNA). Both genes have the same Haelll R/M sequences 
appended to act as a substrate (Fig. 1b). After translation in the aqueous compartments of an emulsion the Haelll R/ 
M sequences attached to methylase genes are methylated. These genes are rendered resistant to cleavage by Haelll 
endonuclease and are subsequently amplified by PCR. fo/A genes, present in other compartments, remain unmethyl- 
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ated, are cleaved and not amplified. The PCR products are analysed by agarose gel electrophoresis where enrichment 
for the M.Haelll genes can be visualised by the appearance of a 1194 bp band (Fig. 2b). 

[0207] The £ coli S30 extract system for linear DNA (Promega) is used, supplemented with g3 carrier DNA (1 0 nM), 
DNA fragments (DtG-M.Haelll-3s-Biotin and DIG-M.fo/A-3s-Biotin at ratios and concentrations indicated below), T7 
RNA polymerase (10 4 units), sodium deoxycholate (Fluka, 0.5% w/v; in emulsified reactions only) and S-adenosyl 
methionine (NEB, 80 uJvt). Reactions are set up using DNA fragments DIG-M.Haelll-3s-Biotin and DIG-M.fo/A-3s-Biotin 
at a ratio of 1 :1 0 3 and at a total concentration of 200 pM (Fig. 3a) and ratios of 1 : 1 0 4 to 1 : 1 0 7 and a total concentration 
of 500 pM (Fig. 3b). Fifty microliter reactions are prepared on ice and emulsified by stirring only as described in Example 
1 Reactions are incubated for 2 hours at 25°C. To recover the reaction mixtures, the emulsions are spun at 3,000 g 
for 5 minutes and the oil phase removed leaving the concentrated emulsion at the bottom of the vial. Quenching buffer 
(0<2 ml of 25 jig/ml yeast RNA in W+B buffer: 1 M NaCI, 10 mM Tris-HCI, 1 mM EDTA pH 7.4) and 2 ml of water- 
saturated ether are added and the mixture is vortexed, centrifuged briefly., and the ether phase removed. The aqueous 
phase is washed with ether and dried (5 minutes in a Speedvac at ambient temperature). DNA is captured on 1 00 \ig 
M-280 streptavidin Dynabeads (2 hours at ambient temperature). The Dynabeads are washed sequentially with: W+B 
buffer; 2.25 M Guanidine-HCI, 25 mM Tris-HCI, pH 7.4; W+B buffer; and, twice with restriction buffer. Beads are re- 
suspended in IOOuJ restriction buffer containing 10 units Haelll (or Hha\) and incubated at 37°C for 5 hours. The beads 
are washed three times with W+B buffer, twice with 50 mM KCI, 1 0 mM Tris-HCI, 0.1% Triton X-1 00, pH 9.0, and then 
resuspended in 100 uJ of the same buffer supplemented with 1 .5 mM MgCI 2 (PCR buffer). Altquots of beads (2-20 uJ) 
are amplified by PCR using Taq polymerase added at 94*C with primers LMB2-Biotin and LMB3-DIG (50 uJ reactions; 
32 cycles of 1 minute at 94°, 1 minute at 55°, 2 minutes at 72°). This DNA is purified using Wizard PCR Preps and 
used for the second round of selection (20 pM in the 1:10 4 and 1:1 0 5 selections and 500 pM in the 1:10 6 and 1:10 7 
selections). For gel electrophoresis and activity assays this DNA (diluted to -1pM) is further amplified with primers 
LMB2-Nest and LMB3-Nest which anneal immediately inside LMB2 and LMB3 respectively (25 cycles of 1 minute at 
94°, 1 minute at 50°, 1 .5 minutes at 72°) and purified as above. This DNA (at 10 nM), which has neither DIG nor Biotin 
appended, is also translated in vitro in the presence of 10 nM DIG-D1 .3-Biotin, a 942 bp DNA containing four Haelll 
R/M sites. Methylation of the DIG-D1 .3-Biotin substrate is determined by DIG-Biotin ELISA as Example 9. 
[0208] A single round of selection of a 1 :1 000 ratio of M.Haelll : folk genes in the emulsion results in a roughly 1 :1 
final gene ratio (Fig. 3a). Several control experiments indicate that selection proceeds according to the mechanism 
described above: a band corresponding to the M.Haelll gene is not observed when the initial mixture of genes is 
amplified by PCR; nor after reaction in solution (non-emulsified); nor when emulsified in the absence of transcription/ 
translation (when T7 RNA polymerase is omitted): nor when the reacted genes are cleaved at R/M sites other than 
those of Haelll e.g., after digestion with Hha\. The yield of M.Haelll DNA after selection is less than 100% primarily 
due to incomplete digestion by Haell I rather than cross-methylation as indicated by the large foiA band observed in 
the absence of methylase activity (when T7 polymerase is not added). During digestion, the concentration of DNA 
drops well below the K M of Haelll (6 nM) and digestion becomes extremely inefficient. 

[0209] A band corresponding to M.Haelll genes also becomes visible after a single-round of selection starting from 
M.Haelll: folA ratios of 1:10 4 to 1:10 5 (Fig. 3b), but not at lower ratios, indicating an enrichment factor of at least 
5000-fold. Selection of a small number of genes from a large pool (e.g., a gene library) therefore requires further rounds 
of selection. When the Haelll-digested and amplified DNA from the first round of selection is subjected to a second 
round of selection, a band corresponding to M.Haelll genes also became visible from 1:10 6 and 1 :10 7 starting ratios 
of M.Haelll: fo/A. A second round of selection is also performed on the DNA derived from the 1:10 4 to 1:10 5 starting 
ratios of M.Haelll: fo/A. This gives a further enrichment, up to a ratio of approximately 3:1 in favour of the M.Haelll 
genes. Before and after each round of selection the genes are amplified, translated in vitro and reacted with a separate 
DNA substrate to assay for Haelll methylase activity. These assays indicate that enrichment for the M.Haelll genes 
as observed by gel electrophoresis results in a parallel increase in Haelll methylase activity (Fig. 3b). 
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SEQUENCE LISTING 
45 [0211] 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

50 

(A) NAME: MEDICAL RESEARCH COUNCIL 

(B) STREET: 20 PARK CRESCENT 

(C) CITY: LONDON 
(E) COUNTRY: UK 

55 (F) POSTAL CODE (ZIP): WIN 4AL 

(ii) TITLE OF INVENTION: IN VITRO SORTING METHOD 
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(in) NUMBER OF SEQUENCES: 23 
(iv) COMPUTER READABLE FORM: 

5 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

10 (2) INFORMATION FOR SEQ ID NO: 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 
'5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 
25 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

3Q AGCTTGCATG CCTGCGGTAC CGGCCATGCG CATGGCCTAG CGCATGCGGC CGCTAGCGCG 60 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

35 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc= "SYNTHETIC OLIGONUCLEOTIDE" 
45 (ijj) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

50 

AATTCGCGCT AGCGGCCGCA TGCGCTAGGC CATGCGCATG GCCGGTACCG CAGGCATGCA 60 

55 (2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



15 



20 



40 



45 



CGAGCTAGAG GTACCTTATT AATTACCTTT ACAAATTTCC AATGCAGATT TTAT 54 

(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

30 (A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



GCATCTGACA AGCTTAATAA TTTTGTTTAA CTTTAAGAAG GAGATATACA TATGAATTTA 60 
ATTAGTCTTT TTTCAGGTGC AGGG 34 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

55 (A) DESCRIPTION: /desc= "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



CGAGCTAGAG GTACCTTATT ACCGCCGCTC CAGAATCTCA AAGCAATAG 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GCATCTGACA AGCTTAATAA TTTTGTTTAA CTTTAAGAAG GAG AT AT ACA TATGATCAGT 
CTGATTGCGG CGTTAGCGGT AG 

(2) INFORMATION FOR SEQ ID NO: 7: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GAATTGGATT TAGGTGAC 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 



36 



_14fi2036A2J_> 



EP 1 482 036 A2 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CATGATTACG CCAAGCTC 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GTAAAACGAC GGCCAGT 
(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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CAGGAAACAG CTATGAC 17 

(2) INFORMATION FOR SEQ ID NO: 11: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

'5 (A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 : 
GTCTCTGAAT TTACCGTTCC AG 22 

25 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



40 



45 



GAAACTGTTG AAAGTTGTTT AG 



(2) INFORMATION FOR SEQ ID NO: 13: 
so (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 
(iii) HYPOTHETICAL: NO 
5 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



10 ATTATAATAC GACTCACTAT AGGGAGAGTT ATCAGGCATG CACC 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

15 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 
25 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 14: 

30 

CTAGCTCCCA TTAAGGAG 



(2) INFORMATION FOR SEQ ID NO: 15: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

45 (A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



GTAAAACGAC GGCCAGT 



(2) INFORMATION FOR SEQ ID NO: 16: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

™ (A) DESCRIPTION: /desc = -SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



15 



CAGGAAACAG CTATGAC !0 

20 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

25 (A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (») MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

35 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

40 

CGAGCTAGAG GTACCGCGGC CG CTGCGCTT ATTAATATGG TTTGAAATTT AATGATGAAC 60 
CAATG 65 

45 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

50 (A) LENGTH: 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GCATCTGACA AGCTTAATAA TTTTGTTTAA CTTTAAGAAG GAGATATACA TATGATTGAA 
ATAAAAGATA AACAGCTCAC AGG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc= "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CGAGCTAGAG GTACCGCGGC CGCTGCGCTT ATTAATTACC TTTACAAATT TCCAATGCAG 
ATTTTAT 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
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CAGGAAACAG CTATGACAAG CTTAATACGA CTCACTATAG GGAGATATTT TTTATTTTAA 60 

5 

TAAGGTTTTA ATTAATGG 78 

(2) INFORMATION FOR SEQ ID NO: 21: 
10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
'5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

20 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 : 



GTAAAACGAC GGCCAGTGAA TTCTTATTAC TTTTGTAATC GTTTGTTTTT TATC 54 

30 

(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

35 (A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

45 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

so 

CAGGAAACAG CTATGACAAG CTTAATACGA CTCACTATAG GGAGAAATGG GTTTCTTTGG 60 
CATATTTTTT ACAAATG 77 

55 

(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



GTAAAACGAC GGCCAGTGAA TTCGATATCT TATTACTCTT CAATTACCAA AATATCCCC 



25 



30 



35 4 



50 



55 



Claims 

1 . A method for increasing the concentration of a nucleic acid molecule, said method comprising: 

(a) forming aqueous microcapsules from a water-in-oil emulsion, wherein a plurality of the microcapsules 
include a nucleic acid molecule and an aqueous solution comprising components necessary for nucleic acid 
amplification; and 

(b) amplifying the nucleic acid molecule in the microcapsules to form further amplified copies of said nucleic 
acid molecule. 

2. A method as claimed in claim 1 , further comprising coupling said nucleic acid to a solid-phase support. 

3. A method as claimed in claim 2, wherein said solid-phase support comprises a bead. 
A method as claimed in claim 3, wherein said bead is a polystyrene or magnetic bead. 



5. A method as claimed in claim 2 or claim 3, wherein said bead comprises a coating selected from avidin and 
streptavidin. 



40 6. 



A method as claimed in of claim 5, wherein the nucleic acid molecule comprises a biotin tag. 



7. A method as claimed in any preceding claim, wherein said nucleic acid amplification is performed using RNA 
polymerase, Q|3 replicase amplification, ligase chain reaction, self-sustained sequence replication or strand dis- 
placement amplification. 

8. A method as claimed in any of claims 1 to 6, wherein said nucleic acid amplification is performed using polymerase 
chain reaction. 

9. A method as claimed in any preceding claim, wherein said emulsion includes at least one emulsion stabilizer. 

10. A method as claimed in claim 9, wherein said emulsion stabilizer is a non-ionic surfactant. 

11. A method as claimed in of claim 10, wherein said emulsion stabilizer is selected from sorbitan monooleate and 
polyoxyethylenesorbitan monooleate. 

12. A method as claimed in of claim 9, wherein the emulsion stabilizer is an anionic surfactant. 

13. A method as claimed in of claim 12, wherein the emulsion stabilizer is selected from sodium cholate, sodium 
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glycocholate, sodium taurocholate, and sodium deoxycholate. 



14. A method as claimed in any preceding claim, wherein the emulsion is thermostable. 

5 15. A method as claimed in claim 2, wherein said solid-phase support is a bead, said nucleic acid amplification is 
performed using the polymerase chain reaction, and the emulsion is thermostable. 

16. A method as claimed in any preceding claim, wherein a plurality of microcapsules when formed each contains on 
average one or less than one nucleic acid molecule. 

w 

17. A method as claimed in any of claims 1 to 15, wherein a plurality of microcapsules when formed each contains on 
average between 5 and 1000 nucleic acid molecule. 
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